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Novel poly (ADP-ribose) polymerase genes 

The present invention relates to novel poly ( ADP-ribose ) polyme- 
5 rase (PARP) genes and to the proteins derived therefrom; 
antibodies with specificity for the novel proteins; 
pharmaceutical and gene therapy compositions which comprise 
products according to the invention; methods for the analytical 
determination of the proteins and nucleic acids according to the 
10 invention; methods for identifying effectors or binding partners 
of the proteins according to the invention; methods for 
determining the activity of such effectors and use thereof for 
the diagnosis or therapy of pathological states. 

15 In 1966, Chambon and co-workers discovered a 116 kDA [sic] enzyme 
that was characterized in detail in subsequent years and is now 
called PARP (EC 2.4.2.30) (poly ( adenosine-5 '-diphosphoribose) 
polymerase ) , PARS ( poly ( adenosine-5 ' -diphosphoribose ) synthase ) 
or ADPRT (adenosine-5 ' -diphosphoribose transferase). This enzyme 

20 has to date been unique in its activity, which is described 
below. It is referred to as PARP1 below to avoid ambiguity. 

The primary physiological function of PARP 1 appears to be its 
involvement in a complex repair mechanism which cells have 
25 developed to repair DNA strand breaks. The primary cellular 
response to a DNA strand break appears moreover to consist of 
PARPl-catalyzed synthesis of poly (ADP-ribose ) from NAD+ (cf . De 
Murcia, G. et al. (1994) TIBS, 19, 172). 

30 PARP 1 has a modular molecular structure. Three main functional 
elements have been identified to date: an .N-terminal 46kDa DNA 
binding domain; a central 22kDa automodif ication domain to which 
poly (ADP-ribose) becomes attached, with the PARP 1 enzyme 
activity decreasing with increasing elongation; and a C-terminal 

35 54 kDa NAD + binding domain. A leucine zipper region has been found 
within the automodif ication domain, indicating possible 
protein-protein interactions, only in the PARP from Drosophila. 
All PARPs known to date are presumably active as homodimers . 

40 The high degree of organization of the molecule is reflected 
again in the strong conservation of the amino acid sequence. 
Thus, 62% conservation of the amino acid sequence has been found 
for PARP 1 from humans, mice, cattle and chickens. There are 
greater structural differences from the PARP from Drosophila. The 

45 individual domains themselves in turn have clusters of increased 
conservation. Thus, the DNA binding region contains two so-called 
zinc fingers as subdomains (comprising motifs of the type 
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CX2CX 2 8/30 HX 2 C ) / which are involved in the Zn 2+ -dependent 
recognition of DNA single strand breaks or single-stranded DNA 
overhangs (e.g. at the chromosome ends, the telomeres). The 
C-terminal catalytic domain comprises a block of about 50 amino 
5 acids (residues 859-908 ) , which is 100% conserved among 

vertebrates (PARP "signature") . This block binds the natural 
substrate NAD+ and thus governs the synthesis of poly (ADP-ribose) 
(cf. de Murcia, loc.cit.). The GX3GKG motif in particular is 
characteristic of PARPs in this block. 

10 

The beneficial function described above contrasts with a 
pathological one in numerous diseases (stroke, myocardial 
infarct, sepsis etc.). PARP is involved in cell death resulting 
from ischemia of the brain (Choi, D.W., (1997) Nature Medicine, 

15 3, 10, 1073), of the myocardium (Zingarelli, B., et al (1997), 
Cardiovascular Research, 36, 205) and of the eye (Lam, T.T. 
(1997), Res. Comm. in Molecular Pathology and Pharmacology, 95, 
3, 241). PARP activation induced by inflammatory mediators has 
also been observed in septic shock (Szabo, C, et al. (1997), 

20 Journal of Clinical Investigation, 100, 3, 723). In these cases, 
activation of PARP is accompanied by extensive consumption of 
NAD + . Since four moles of ATP are consumed for the biosynthesis of 
one mole of NAD + , the cellular energy supply decreases 
drasticallly . The consequence is cell death. 

25 

PARP1 inhibitors described in the abovementioned specialist 
literature are nicotinamide and 3-aminobenzamide . 3,4-Di- 
hydro-5 [ 4-1 ( 1 -piper idinyl)butoxy]-l ( 2H) -isoquinolone [ sic] is 
disclosed by Takahashi, K., et al (1997), Journal of Cerebral 
30 Blood Flow and Metabolism 17, 1137. Further inhibitors are 

described, for example, in Banasik, M., et al. (1992) J. Biol. 
Chem. , 267, 3, 1569 and Griffin, R.J., et al. (1995), Anti-Cancer 
Drug Design, 10, 507. 

35 High molecular weight binding partners described for human PARP1 
include the base excision repair (BER) protein XRCC1 (X-ray re- 
pair cross -complementing 1) which binds via a zinc finger motif 
and a BRCT (BRCA1 C terminus) module (amino acids 372-524) 
(Masson, M., et al., (1998) Molecular and Cellular Biology, 18,6, 

40 3563) . 

It is an object of the present invention, because of the diverse 
physiological and pathological functions of PARP, to provide 
novel PARP homologs. The reason for this is that the provision of 
45 homologous PARPs would be particularly important for developing 
novel targets for drugs, and novel drugs, in order to improve 
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diagnosis and/or therapy of pathological states in which PARP, 
PARP homologs or substances derived therefrom are involved. 

We have found that this object is achieved by providing PARP 
5 homologs having an amino acid sequence which has 

a) a functional NAD+ binding domain i.e. a PARP "signature" 
sequence having the characteristic GX3GKG motif; 

and 

b) especially in the N-terminal sequence region, i.e. in the 

10 region of the first 200, such as, for example, in the region 

of the first 100, N-terminal amino acids, no PARP zinc finger 
sequence motifs of the general formula 

CX 2 CX m HX 2 C 

in which 

15 m is an integral value from 28 or 30, and the X radicals are, 

independently of one another, any amino acid; 
and the functional equivalents thereof. 

Since the PARP molecules according to the invention represent in 
20 particular functional homologs, they naturally also have a 

poly (ADP-ribose) -synthesizing activity. The NAD binding domain 
essentially corresponds to this activity and is localized to the 
C terminus . 



25 Thus an essential characteristic of the PARPs according to the 
invention is the presence of a functional NAD + binding domain 
(PARP signature) which is located in the C-terminal region of the 
amino acid sequence (i.e. approximately in the region of the last 
400, such as, for example, the last 350 or 300, C-terminal amino 

30 acids), in combination with an N-terminal sequence having no zinc 
finger motifs. Since the zinc finger motifs in known PARPs 
presumably contribute to recognition of the DNA breakages, it is 
to be assumed that the proteins according to the invention do not 
interact with DNA or do so in another way. It has been 

35 demonstrated by appropriate biochemical tests that the PARP 2 
according to the invention can be activated by 'activated DNA' 
(i.e. DNA after limited DNAasel [sic] digestion). It can be 
concluded from this further that the PARP 2 according to the 
invention have [sic] DNA binding properties. However, the 

40 mechanism of the DNA binding and enzyme activation differs 

between the PARPs according to the invention and PARP1. its DNA 
binding and enzyme activation is, as mentioned, mediated by a 
characteristic zinc finger motif. No such motifs are present in 
the PARPs according to the invention. Presumably these properties 

45 are mediated by positively charged amino acids in the N-terminal 
region of the PARPs according to the invention. Since the 
'activated DNA' (i.e. for example DNA after limited treatment 
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with DNAasel [sic]) has a large number of defects (single strand 
breaks, single strand gaps, single-stranded overhangs, double 
strand breaks etc.)/ it is possible that although PARP1 and the 
PARPs according to the invention are activated by the same 
5 'activated DNA' , it is by a different subpopulation of defects 
(e.g. single strand breaks). 

The functional NAD+ binding domain (i.e. catalytic domain) binds 
the substrate for poly-ADP-ribose synthesis. Consistent with ' 

10 known PARPs, the sequence motif GX 1 X 2 X 3 GKG, in which G is glycine, 
K is lysine, and X 1 , X 2 and X 3 are, independently of one another, 
any amino acid, is present in particular. However, as shown, 
surprisingly, by comparison of the amino acid sequences of the 
NAD + binding domains of PARP molecules according to the invention 

15 with previously disclosed human PARPl, the sequences according to 
the invention differ markedly from the known sequence for the NAD+ 
binding domain. 

A group of PARP molecules which is preferred according to the 
20 invention preferably has the following general sequence motif in 
the catalytic domain in common: 

PX n (S/T)GX 3 GKGIYFA, in particular 
(S/T)XGLR(I/V)XPX n (S/T)GX 3 GKGIYFA, preferably 
2 5 LLWHG (S/T)X 7 IL(S/T) XGLR ( I / V ) XPX n ( S/T ) GX 3 GKGI YFAX3 SKSAXY 

in which (S/T) describes the alternative occupation of this 
sequence position by S or T, (I/V) describes the alternative 
occupation of this sequence position by I or V, and n is an 
30 integral value from 1 to 5, and the X radicals are, independently 
of one another, any amino acid. The last motif is also referred 
to as the "PARP signature" motif. 

The automodif ication domain is preferably likewise present in the 
35 PARPs according to the invention. It can be located, for example, 
in the region from about 100 to 200 amino acids in front of the 
N-terminal end of the NAD + binding domain. 

PARP homologs according to the invention may additionally 
40 comprise, N-terminally of the NAD+ binding domain (i.e. about 30 
to about 80 amino acids closer to the N terminus), a leucine 
zipper-like sequence motif of the general formula 

(L/V)X 6 LX 6 LX 6 L 

in which (L/V) represents the alternative occupation of this 
45 sequence position by L or V, and the X radicals are, 

independently of one another, any amino acid. The leucine zipper 
motifs observed according to the invention differ distinctly in 
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position from those described for PARP from Drosophila. Leucine 
zippers may lead to homodimers (two PARP molecules) or 
heterodimers (one PARP molecule with a binding partner differing 
therefrom) . 

5 

The PARP homologs according to the invention preferably 
additionally comprise, N-terminally of the abovementioned leucine 
zipper-like sequence motif, i.e. about 10 to 250 amino acid 
residues closer to the N terminus, at least another one of the 
10 following part-sequence motifs: 

LX9NX 2 YX2QLLX(D/E)X b WGRVG, (motif 1) 

AX3FXKX4KTXNXWX5FX3PXK , (motif 2) 

QXL(I/L)X 2 IX 9 MXioPLGKLX3QIX 6 L, (motif 3) 

15 FYTXIPHXFGX3PP , (motif 4) and 

KX 3 LX 2 LXDIEXAX 2 L (motif 5), 

in which (D/E) describes the alternative occupation of this 
sequence position by D or E, (I/L) describes the alternative 
20 occupation of this sequence position by I or L, b is the integral 
value 10 or 11, and the X radicals are, independently of one 
another, any amino acid. It is most preferred for these motifs 1 
to 5 all to be present in the stated sequence, with motif 1 being 
closest to the N terminus. 



25 



The abovementioned PARP signature motif is followed in the 
proteins according to the invention by at least another one of 
the following motifs: 



30 GX3LXEVALG (motif 6) 

GX 2 SX 4 GX3PX a LXGX2V (motif 7) and 

E(Y/F)X 2 YX 3 QX 4 YLL (motif 8) 

in which (Y/F) describes the alternative occupation of this 
35 sequence position by Y or F, a is equal to 7 to 9 and X is in 
each case any amino acid. It is most preferred for the three 
C-terminal motifs all to be present and in the stated sequence, 
with motif 8 being closest to the C terminus. 

40 A preferred PARP structure according to the invention may be 
described schematically as follows: 

Motifs 1 to 5/PARP signature/motifs 6 to 8 or 

motifs 1 to 5/leucine zipper/PARP signature/motifs 6 to 8 

45 
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it being possible for further amino acid residues, such as, for 
example, up to 40, to be arranged between the individual motifs 
and for further amino acid residues, such as, for example, up to 
80, to be arranged at the N terminus and/or at the C terminus. 

5 

PARP homologs which are particularly preferred according to the 
invention are the proteins human PARP2, human PARP3, mouse PARP 3 
and the functional equivalents thereof. The proteins [sic] 
referred to as human PARP 2 comprises 570 amino acids (cf . SEQ ID 

10 N0:2). The protein referred to as human PARP 3 possibly exists in 
two forms. Type 1 comprises 533 amino acids (SEQ ID NO: 4) and 
type 2 comprises 540 amino acids (SEQ ID N0:6). The forms may 
arise through different initiation of translation. The proteins 
[sic] referred to as mouse PARP 3 exists in two forms which differ 

15 from one another by a deletion of 5 amino acids (15 bp). Type 1 
comprises 533 amino acids (SEQ ID NO: 8) and type 2 comprises 528 
amino acids (SEQ ID NO: 10). 

The invention further relates to the binding partners for the 
20 PARP homologs according to the invention. These binding partners 
are preferably selected from 

a) antibodies and fragments such as, for example, Fv, Fab, 
(Fab)' 2 [sic], thereof 

b) protein-like compounds which interact, for example via the 

25 above leucine zipper region or another sequence section, with 

PARP , and 

c) low molecular weight effectors which modulate a biological 
PARP function such as, for example, the catalytic PARP 
activity, i.e. NAD + -consuming ADP ribosylation, or the 

30 binding to an activator protein or to DNA. 

The invention further relates to nucleic acids comprising 

a) a nucleotide sequence coding for at least one PARP homolog 
according to the invention, or the complementary nucleotide 

35 sequence thereof; 

b) a nucleotide sequence which hybridizes with a sequence as 
specified in a), preferably under stringent conditions; or 

c) nucleotide sequences which are derived from the nucleotide 
sequences defined in a) and b) through the degeneracy of the 

40 genetic code. 

Nucleic acids which are suitable according to the invention 
comprise in particular at least one of the part-sequences which 
code for the abovementioned amino acid sequence motifs. 

45 
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Nucleic acids which are preferred according to the invention 
comprise nucleotide sequences as shown in SEQ ID NO: 1 and 3, 
and, in particular, part-sequences thereof which are 
characteristic of PARP homologs according to the invention, such 
5 as, for example, nucleotide sequences comprising 

a) nucleotides +3 to +1715 shown in SEQ ID N0:1; 

b) nucleotides +242 to +1843 shown in SEQ ID NO:3; 

c) nucleotides +221 to +1843 shown in SEQ ID NO:5; 

10 d) nucleotides +112 to +1710 shown in SEQ ID N0:7; or 
e) nucleotides +1 to +1584 shown in SEQ ID NO: 9 

or part-sequences of a), b), c), d) and e) which code for the 
abovementioned characteristic amino acid sequence motifs of the 
15 PARP homologs according to the invention. 

The invention further relates to expression cassettes which 
comprise at least one of the above-described nucleotide sequences 
according to the invention under the genetic control of 
20 regulatory nucleotide sequences. These can be used to prepare 
recombinant vectors according to the invention, such as, for 
example, viral vectors or plasmids, which comprise at least one 
expression cassette according to the invention. 

25 Recombinant microorganisms according to the invention are 
transformed with at least one of the abovementioned vectors. 

The invention also relates to transgenic mammals transfected with 
a vector according to the invention. 

30 

The invention further relates to an in vitro detection method, 
which can be carried out homogeneously or heterogeneously , for 
PARP inhibitors, which comprises 

incubating an unsupported or supported polyADP-ribosylatable 
target with a reaction mixture comprising 
al) a PARP homolog according to the invention; 
a2 ) a PARP activator; and 

a3 ) a PARP inhibitor or an analyte in which at least one PARP 

inhibitor is suspected; 
carrying out the polyADP ribosylation reaction; and 
determining the polyADP ribosylation of the target qualitat- 
ively or quantitatively. 

45 The detection method is preferably carried out by preincubating 
the PARP homolog with the PARP activator and the PARP inhibitor 
or an analyte in which at least one PARP inhibitor is suspected. 



35 



a) 



40 



b) 
c) 
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for example for about 1-30 minutes, before carrying out the poly- 
ADP ribosylation reaction. 

After activation by DNA with single strand breaks (referred to as 
5 "activated DNA" according to the invention), PARP polyADP ribosy- 
lates a large number of nuclear proteins in the presence of NAD. 
These proteins include, on the one hand, PARP itself, but also 
histones etc. 

10 The polyADP-ribosylatable target preferably used in the detection 
method is a histone protein in its native form or a polyADP- 
ribosylatable equivalent derived therefrom. A histone preparation 
supplied by Sigma (SIGMA, catalogue No. H-7755; histone type li- 
as [sic] from calf thymus, Luck, J. M., et al., J. Biol. Chem., 

15 233, 1407 (1958), Satake K . , et al., J. Biol. Chem, 235, 2801 
(I960)) was used by way of example. It is possible in principle 
to use all types of proteins or parts thereof amenable to poly- 
ADP-ribosylation by PARP. These are preferably nuclear proteins, 
e.g. histones, DNA polymerase, telomerase or PARP itself. Syn- 

20 thetic peptides derived from the corresponding proteins can also 
act as target. 

In the ELISA assay according to the invention it is possible to 
use amounts of histones in the range from about 0.1 fxg/well to 

25 about 100 jig/well, preferably about 1 |xg/well to about 10 fig/well. 
The amounts of the PARP enzyme are in a range from about 
0.2 pmol/well to about 2 nmol/well, preferably from about 2 pmol/ 
well to about 200 pmol/well, the reaction mixture comprising in 
each case 100 ^ig/well. Reductions to smaller wells and correspon- 

30 dingly smaller reaction volumes are possible. 

In the HTRF assay according to the invention, identical amounts 
of PARP are employed, and the amount of histone or modified hi- 
stones is in the range from about 2 ng/well to about 25 fig/well, 
35 preferably about 25 ng/well to about 2.5 jig/well, the reaction 
mixture comprising in each case 50 pil/well. Reductions to smaller 
wells and correspondingly smaller reaction volumes are possible. 

The PARP activator used according to the invention is preferably 
40 activated DNA. 

Various types of damaged DNA can function as activator. DNA dam- 
age can be produced by digestion with DNAases [sic] or other DNA- 
modifying enzymes (e.g. restriction endonucleases ) , by irradi- 
45 at ion or other physical methods or chemical treatment of the DNA. 
It is further possible to simulate the DNA damage situation in a 
targeted manner using synthetic oligonucleotides. In the assays 
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indicated by way of example, activated DNA from calf thymus was 
employed (Sigma, product No. D4522; CAS: 91080-16-9, prepared by 
the method of Aposhian and Kornberg using calf thymus DNA (SIGMA 
D-1501) and deoxyribonuclease type I (D-4263). Aposhian H. V. and 
5 Kornberg A., J. Biol. Chem. , 237, 519 (1962)). The activated DNA 
was used in a concentration range from 0.1 to 1000 ^ig/ml, prefera- 
bly from 1 to 100 ^g/rnl, in the reaction step. 

The polyADP ribosylation reaction is started in the method 
10 according to the invention by adding NAD + . The NAD concentrations 
were in a range from about 0.1 [iM to about 10 mM, preferably in a 
range from about 10 (iM to about 1 mM. 

In the variant of the above method which can be carried out 
15 heterogeneously, the polyADP ribosylation of the supported target 
is determined using anti-poly (ADP-ribose) antibodies. To do this, 
the reaction mixture is separated from the supported target, 
washed and incubated with the antibody. This antibody can itself 
be labeled. However, it is preferable to use for detecting bound 
20 anti-poly (ADP-ribose) antibody a labeled secondary antibody or a 
corresponding labeled antibody fragment. Suitable labels are, for 
example, radiolabeling, chromophore- or f luorophore-labeling, 
biotinylation, chemiluminescence labeling, labeling with para- 
magnetic material or, in particular, enzyme labels, e.g. with 
25 horseradish peroxidase. Appropriate detection techniques are gen- 
erally known to the skilled worker. 

In the variant of the above process which can be carried out 
homogeneously, the unsupported target is labeled with an acceptor 

30 fluorophore. The target preferably used in this case is biotiny- 
lated histone, the acceptor fluorophore being coupled via avidin 
or streptavidin to the biotin groups of the histone. Particularly 
suitable as acceptor fluorophore are phycobiliproteins (e.g. phy- 
cocyanins, phycoerythrins ) , e.g. R-phycocyanin (R-PC ) , allophyco- 

35 cyanin (APC), R-phycoerythrin (R-PE), C-phycocyanin (C-PC), 

B-phycoerythrin (B-PE) or their combinations with one another or 
with fluorescent dyes such as Cy5, Cy7 or Texas Red (Tandem sys- 
tem) (Thammapalerd, N. et al., Southeast Asian Journal of Tropi- 
cal Medicine & Public Health, 27(2): 297-303 (1996); Kronick, M. 

40 N. et al., Clinical Chemistry, 29(9), 1582-1586 ( 1986 ) ; Hicks, J. 
M . , Human Pathology, 15(2), 112-116 (1984)). The dye XL665 used 
in the examples is a crosslinked allophycocyanin (Glazer, A. N. , 
Rev. Microbiol., 36, 173-198 (1982); Kronick, M. N • , J. Imm. 
Meth., 92, 1-13 (1986); MacColl, R. et al., Phycobiliproteins, 

45 CRC Press, Inc., Boca Raton, Florida (1987); MacColl, R. et al., 
Arch. Biochem. Biophys., 208(1), 42-48 (1981)). 
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It is additionally preferred in the homogeneous method to deter- 
mine the polyADP ribosylation of the unsupported target using 
anti-poly (ADP-ribose ) antibody which is labeled with a donor 
fluorophore which is able to transfer energy to the acceptor 
5 fluorophore when donor and acceptor are close in space owing to 
binding of the labeled antibody to the polyADP-ribosylated hi- 
stone. A europium cryptate is preferably used as donor flurophore 
for the anti-poly (ADP-ribose) antibody. 

10 Besides the europium cryptate used, other compounds are also 
possible as potential donor molecules. This may entail, on the 
one hand, modification of the cryptate cage. Replacement of the 
europium by other rare earth metals such as terbium are [sic] 
also conceivable. It is crucial that the fluorescence has a long 

15 duration to guarantee the time delay (Lopez, E. et al., Clin. 
Chem. 39/2, 196-201 ( 1993 ); US Patent 5,534,622). 

The detection methods described above are based on the principle 
that there is a correlation between the PARP activity and the 

20 amount of ADP-ribose polymers formed on the histones. The assay 
described herein makes it possible to quantify the ADP-ribose 
polymers using specific antibodies in the form of an ELISA and an 
HTRF (homogenous time-resolved fluorescence) assay. Specific em- 
bodiments of these two assays are described in detail in the fol- 

25 lowing examples. 

The developed HTRF (homogeneous time-resolved fluorescence) assay 
system measures the formation of poly (ADP-ribose) on histones 
using specific antibodies. In contrast to the ELISA, this assay 

30 is carried out in homogeneous phase without separation and wash- 
ing steps. This makes a higher sample throughput and a smaller 
susceptibility to errors possible. HTRF is based on the fluor- 
escence resonance energy transfer (FRET) between two fluoro- 
phores. In a FRET assay, an excited donor fluorophore can 

35 transfer its energy to an acceptor fluorophore when the two are 
close to one another in space. In HTRF technology, the donor 
fluorophore is a europium cryptate [(Eu)K] and the acceptor is 
XL665, a stabilized allophycocyanin . The europium cryptate is 
based on studies by Jean Marie Lehn (Strasbourg) (Lopez, E. et 

40 al., Clin. Chem. 39/2, 196-201 (1993); US Patent 5,534,622). 

In a homogeneous assay, all the components are also present dur- 
ing the measurement, whereas this has advantages for carrying out 
the assay (rapidity, complexity), it is necessary to preclude in- 
45 terference by assay components (inherent fluorescence, quenching 
by dyes etc.). HTRF precludes such interference by time-delayed 
measurement at two wavelengths (665 nm, 620 nm) . The HTRF fluor- 
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escence [sic] has a very long decay time and time-delayed 
measurement is therefore possible. There is no longer any inter- 
ference from short-lived background fluorescence (e.g. from assay 
components or inhibitors of the substance bank) . In addition, 
5 measurement is always carried out at two wavelengths in order to 
compensate for quench effects of colored substances. HTRF assays 
can be carried out, for example, in 96- or 384-well microtiter 
plate format and are evaluated using a discovery HTRF microplate 
analyzer (Packard Instruments). 

10 

Also provided according to the invention are the following in 
vitro screening methods for binding partners for PARP, in 
particular for a PARP homolog according to the invention. 

15 A first variant is carried out by 

al) immobilizing at least one PARP homolog on a support; 

bl) contacting the immobilized PARP homolog with an analyte in 
which at least one binding partner is suspected; and 

cl) determining, where appropriate after an incubation period, 
20 analyte constituents bound to the immobilized PARP homolog. 

A second variant entails 

a2) immobilizing on a support an analyte which comprises at least 
one possible binding partner for the PARP homolog; 
25 b2) contacting the immobilized analyte with at least one PARP 
homolog for which a binding partner is sought; and 

c3) examining the immobilized analyte, where appropriate after an 
incubation period, for binding of the PARP homolog. 

30 The invention also relates to a method for the qualitative or 
quantitative determination of a nucleic acid encoding a PARP 
homolog, which comprises 

a) incubating a biological sample with a defined amount of an 

exogenous nucleic acid according to the invention (e.g. with 
35 a length of about 20 to 500 bases or longer), hybridizing, 

preferably under stringent conditions, determining the 
hybridizing nucleic acids and, where appropriate, comparing 
with a standard; or 

40 b) incubating a biological sample with a defined amount of 
oligonucleotide primer pairs with specificity for a PARP 
homolog-encoding nucleic acid, amplifying the nucleic acid, 
determining the amplification product and, where appropriate, 
comparing with a standard. 
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The invention further relates to a method for the qualitative or 
quantitative determination of a PARP homolog according to the 
invention, which comprises 

a) incubating a biological sample with at least one binding 
5 partner specific for a PARP homolog, 

b) detecting the binding partner /PARP complex and, where 
appropriate, 

c) comparing the result with a standard. 

10 The binding partner in this case is preferably an anti-PARP 
antibody or a binding fragment thereof, which carries a 
detectable label where appropriate. 

The determination methods according to the invention for PARP, in 
15 particular for PARP homologs and for the coding nucleic acid 
sequences thereof, are suitable and advantageous for diagnosing 
sepsis- or ischemia-related tissue damage, in particular strokes , 
myocardial infarcts or septic shock. 

20 The invention further comprises a method for determining the 
efficacy of PARP effectors, which comprises 

a) incubating a PARP homolog according to the invention with an 
analyte which comprises an effector of a physiological or 
pathological PARP activity; removing the effector again where 

25 appropriate; and 

b) determining the activity of the PARP homolog, where 
appropriate after adding substrates or cosubstrates . 

The invention further relates to gene therapy compositions which 
30 comprise in a vehicle acceptable for gene therapy a nucleic acid 
construct which 

a) comprises an antisense nucleic acid against a coding nucleic 
acid according to the invention; or 

b) a ribozyme against a noncoding nucleic acid according to the 
35 invention; or 

c) codes for a specific PARP inhibitor. 

The invention further relates to pharmaceutical compositions 
comprising, in a pharmaceutically acceptable vehicle, at least 
40 one PARP protein according to the invention, at least one PARP 
binding partner according to the invention or at least one coding 
nucleotide sequence according to the invention. 

Finally, the invention relates to the use of binding partners of 
45 a PARP homolog for the diagnosis or therapy of pathological 

states in the development and/or progress of which at least one 
PARP protein, in particular a PARP homolog according to the 
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invention, or a polypeptide derived therefrom, are [sic] 
involved* The binding partner used can be, for example, a low 
molecular weight binding partner whose molecular weight can be, 
for example, less than about 2000 dalton or less than about 
5 1000 dalton. 

The invention additionally relates to the use of PARP binding 
partners for the diagnosis or therapy of pathological states me- 
diated by an energy deficit. An energy deficit for the purpose of 

10 the present invention is, in particular, a cellular energy defi- 
cit which is to be observed in the unwell patient systemically or 
in individual body regions, ogans [sic] or organ regions, or tis- 
sues or tissue regions. This is characterized by an NAD and/or 
ATP depletion going beyond the physiological range of variation 

15 of the NAD and/or ATP level and mediated preferably by a protein 
with PARP activity, in particular a PARP homolog according to the 
invention, or a polypeptide derived therefrom. 

The invention particularly relates to the use of a PARP binding 

20 partner as defined above for the diagnosis or therapy (acute or 
prophylactic) of pathological states mediated by energy deficits 
and selected from neurodegenerative disorders, or tissue damage 
caused by sepsis or ischemia, in particular of neurotoxic dis- 
turbances, strokes, myocardial infarcts, damage which [sic] dur- 

25 ing or after infarct lysis with drugs ( B. [sic] with TPA, Rete- 
plase or mechanically with laser or Rotablator) and of microin- 
, farcts during and after heart valve replacement, aneurysm resec- 
tions and heart transplants, trauma to the head and spinal cord, 
infarcts of the kidney (acute kidney failure, acute renal in- 

30 sufficiency or damage during and after kidney transplant), in- 
farcts of the liver (liver failure, damage during or after a 
liver transplant), peripheral neuropathies, AIDS dementia, septic 
shock, diabetes, trauma (craniocerebral trauma), massive bleed- 
ing, subarachnoid hemorrhages, Alzheimer's disease, multi-inf arct 

35 dementia, Huntington's disease, epilepsy, Parkinson's disease, 

amyotrophic lateral sclerosis, kidney failure, also in the chemo- 
therapy of tumors and prevention of metastasis and for the treat- 
ment of inflammations and rheumatic disorders, e.g. of rheumatoid 
arthritis; further for the treatment of revascularization of 

40 critically narrowed coronary arteries and critically narrowed pe- 
ripheral arteries, e.g. leg arteries. 

Nonlimiting examples of tumors are leukemia, glioblastomas, lym-. 
phomas, melanomas, carcinomas of the breast and cervix etc. 

45 
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The present invention will now be described in more detail with 
reference to the appended figures. These show: 

In Figure 1 a sequence alignment of human PARP (human PARP1) and 
5 two PARPs preferred according to the invention ( human PARP 2 , 
human PARP3, murine PARP3 ) . Sequence agreements between human 
PARP1 and human PARP 2 , human PARP 3 or murine PARP 3 are depicted 
within frames. The majority sequence is indicated over the 
alignment. The zinc finger motifs of human PARP1 are located in 
10 the sequence sections corresponding to amnio [sic] acid residues 
21 to 56 and 125 to 162; 

In Figure 2 Northern blots with various human tissues to 
illustrate the tissue distribution of PARP 2 and PARP 3 molecules 

15 according to the invention. Lane 1: brain; lane 2: heart; lane 3: 
skeletal muscle; lane 4 : large bowel; lane 5: thymus; lane 6: 
spleen; lane 7: kidney; lane 8: liver; lane 9: small bowel; 
lane 10: placenta; lane 11: lung; lane 12: peripheral blood leu- 
kocytes; the respective position of the size standard (kb) is 

20 indicated. 

In Figure 3 a Northern blot with further various human tissues to 
illustrate the tissue distribution of the PARP 3 molecule accord- 
ing to the invention. Lane 1: heart; lane 2: brain; lane 3: pla- 
25 centa; lane 4: lung; lane 5: liver; lane 6: skeletal muscle; lane 
7: kidney; lane 8: pancreas; the respective position of the size 
standard (kD [sic]) is indicated. 

In Figure 4 a Western blot with various human tissues to illus- 
30 trate the tissue distribution of the PARP 3 molecule according to 
the invention at the protein level. Lane 1: heart; lane 2: lung; 
lane 3: liver; lane 4: spleen; lane 5: kidney; lane 6: large 
bowels; lane 7: muscle; lane 8: brain; the respective position of 
the size standard (kD) is indicated. 

35 

In Figure 5 a Western blot with various human tissues to illus- 
trate the tissue distribution of the PARP3 molecule according to 
the invention. Lane 1: frontal cortex; lane 2: posterior cortex; 
lane 3: cerebellum; lane 4: hippocampus; lane 5: olfactory bulb; 
40 lane 6: striatum; lane 7: thalamus; lane 8: midbrain; lane 9: en- 
torhinal cortex; lane 10: pons; lane 11: medulla; lane 12: spinal 
cord. 

In Figure 6 a diagrammatic representation of the PARP assay 
45 (ELISA) 
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15 

In Figure 7 a diagrammatic representation of the PARP assay 
( HTRF ) 

Further preferred embodiments of the invention are described in 
5 the following sections. 

PARP homologs and functional equivalents 

Unless stated otherwise, for the purposes of the present 
10 description amino acid sequences are indicated starting with the 
N terminus. If the one-letter code is used for amino acids, then 
G is glycine, A is alanine, V is valine, L is leucine, I is 
isoleucine, S is serine, T is threonine, D is aspartic acid, N is 
asparagine, E is glutamic acid, Q is glutamine, W is tryptophan, 
15 H is histidine, R is arginine, P is proline, K is lysine, Y is 
tyrosine, F is phenylalanine, C is cysteine and M is methionine. 

The present invention is not confined to the PARP homologs 
specifically described above . On the contrary, those homologs 

20 which are functional equivalents thereof are also embraced. 
Functional equivalents comprise both natural, such as, for 
example, species-specific or organ-specific, and artificially 
produced variants of the proteins specifically described herein. 
Functional equivalents according to the invention differ by 

25 addition, substitution, inversion, insertion and/or deletion of 
one or more amino acid residues of human PARP 2 (SEQ ID NO:2), 
human PARP 3 (SEQ ID NO: 4 and 6) and mouse PARP 3 (SEQ ID: 8 and 
10), there being at least retention of the NAD-binding function 
of the protein mediated by a functional catalytic C-terminal 

30 domain. Likewise, the poly (ADP-ribose ) -producing catalytic 

activity should preferably be retained. Functional equivalents 
also comprise where appropriate those variants in which the 
region similar to the leucine zipper is essentially retained. 

35 It is moreover possible, for example, starting from the sequence 
for human PARP 2 or human PARP 3 to replace certain amino acids by 
those with similar physicochemical properties (bulk, basicity, 
hydrophobicity, etc.). It is possible, for example, for arginine 
residues to be replaced by lysine residues, valine residues by 

40 isoleucine residues or aspartic acid residues by glutamic acid 
residues. However, it is also possible for one or more amino 
acids to be exchanged in sequence, added or deleted, or several 
of these measures can be combined together. The proteins which 
have been modified in this way from the human PARP 2 or human 

45 PARP 3 sequence have at least 60%, preferably at least 75%, very 
particularly preferably at least 85%, homology with the starting 



M/40043 



BASF Aktiengesellschaft 990154 O.Z* 0050/49790 



16 

sequence, calculated using the algorithm of Pearson and Lipman, 
Proc. Natl. Acad. Sci (USA) 85(8), 1988, 2444-2448. 

The following homologies have been determined at the amino acid 
5 level and DNA level between human PARP1, 2 and 3 (FastA program, 
Pearson and Lipman, loc. cit.): 
Amino acid homologies: 



15 





Percent identity 


Percent. identity 
in 

PARP signature 


PARP1/PARP2 


41.97% (517) 


86% (50) 


PARP1/PARP3 


33.81% (565) 


53.1% (49) 


PARP2/PARP3 


35.20% (537) 


53.1% (49) 



Numbers in parentheses indicate the number of overlapping amino 
acids . 

20 

DNA Homologies : 



25 





Percent identity 
in the ORF 


Percent identity 
in 

PARP signature 


PARP 1/ PARP 2 


60.81% (467) 


77.85% (149) 


PARP 1/ PARP 3 


58.81% (420) 


59.02% (61) 


PARP 2 / PARP 3 


60.22% (269) 


86.36% (22) 



Numbers in parentheses indicate the number of overlapping 
nucleotides . 

35 

The polypeptides according to the invention can be classified as 
homologous poly ( ADP-ribose) polymerases on the basis of the great 
similarity in the region of the catalytic domain. 

40 It is also essential to the invention that the novel PARP 

homologs do not have conventional zinc finger motifs. This means 
that these enzymes is [sic] not necessarily involved in DNA 
repair or is [sic] so in a way which prevents from [sic] PARP1, 
but are still able to carry out their pathological mechanism (NAD + 

45 consumption and thus energy consumption due to ATP consumption). 
The strong protein expression, particularly of PARP 3 , observable 
in the Western blot suggests a significant role in the NAD 
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consumption. This is particularly important for drug development. 
Potential novel inhibitors of the polymerases according to the 
invention can thus inhibit the pathological functions without 
having adverse effects on the desired physiological properties. 
5 This was impossible with inhibitors against the PARPs known to 
date since there was always also inhibition of the DNA repair 
function. The potentially mutagenic effect of known PARP 
inhibitors is thus easy to understand. It is also conceivable to 
design PARP inhibitors so that they efficiently inhibit all PARP 
10 homologs with high affinity. In this case, a potentiated effect 
is conceivable where appropriate. 

The PARP homolog which is preferred according to the invention 
and is shown in SEQ ID NO: 2 (human PARP 2 ) can advantageously be 
15 isolated from human brain, heart, skeletal muscle, kidney and 
liver. The expression of human PARP 2 in other tissues or organs 
is distinctly weaker. 

The PARP homolog which is preferred according to the invention 
20 and is shown in SEQ ID NO: 4 and 6 (human PARP 3 ) can 

advantageously be isolated from human brain (in this case very 
specifically from the hippocampus), heart, skeletal muscle, liver 
or kidney. The expression of human PARP 3 in other tissues or 
organs, such as muscle or liver, is distinctly weaker. 

25 

The skilled worker familiar with protein isolation will make use 
of the combination of preparative methodologies which is most 
suitable in each case for isolating natural PARPs according to 
the invention from tissues or recombinantly prepared PARPs 
30 according to the invention from cell cultures. Suitable standard 
preparative methods are described, for example, in Cooper, T.G., 
Biochemische Arbeitsmethoden, published by Walter de Gruyter, 
Berlin, New York or in Scopes, R. Protein Purification, Springer 
Verlag, New York, Heidelberg, Berlin. 

35 

The invention additionally relates to PARP 2 and PARP3 homologs 
which, although they can be isolated from other eukaryotic 
species, i.e. invertebrates or vertebrates, especially other 
mammals such as, for example, mice, rats, cats, dogs, pigs, 
40 sheep, cattle, horses or monkeys, or from other organs such as, 
for example the myocardium ^ have the essential structural and 
functional properties predetermined by the PARPs according to the 
invention. 

45 In particular, the human PARP 2 which can be isolated from human 
brain, and its functional equivalents, are preferred agents for 
developing inhibitors of stroke. This is because it can be 
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assumed that drug development based on PARP2 as indicator makes 
it possible to develop inhibitors which are optimized for use in 
the human brain. However, it cannot be ruled out that inhibitors 
developed on the basis of PARP2 can also be employed for treating 
5 PARP-mediated pathological states in other organs too. On the 
basis of the tissue distribution of the proteins according to the 
invention, indications of particular interest are those derived 
from ischemic states of appropriate organs (ischemia of the brain 
(stroke), of the heart (myocardial infarct), damage which [sic] 

10 during or after infarct lysis (e.g. with TPA, Reteplase or mech- 
anically with laser or Rotoblator) and from [sic] microinf arcts 
during and after heart valve replacement, aneurysm resections and 
heart transplants, of the kidney (acute kidney failure, acute 
renal insufficiency or damage during and after a kidney trans- 

15 plant), damage to the liver or the skeletal muscle). Also con- 
ceivable are the treatment and prophylaxis of neurodegenerative 
disorders occurring after ischemia, trauma, (craniocerebral 
trauma), massive bleeding, subarachnoid hemorrhages and stroke, 
and of neurodegenerative disorders such as multi-inf arct demen- 

20 tia, Alzheimer's disease, Huntington's disease and epilepsies, 
especially of generalized epileptic seizures such as petit mal 
and tonoclonic seizures and partial epileptic seizures, such as 
temporal lope [sic], and complex partial seizures. Said proteins 
may also be relevant for the treatment of revascularization of 

25 critical narrowed coronary arteries and critically narrowed pe- 
ripheral arteries, e.g. leg arteries. Said proteins may addition- 
ally play a part in the chemotherapy of tumors and in the preven- 
tion of metastasis, and in the treatment of inflammations and 
rheumatic disorders, e.g. of rheumatoid arthritis. Further patho- 

30 logical states of these and other organs are conceivable. 

PARP2 and 3 are also, similar to PARP1, activated by damaged DNA, 
although by a presumably different mechanism. Significance in DNA 
repair is conceivable. Blockade of the PARPs according to the in- 
35 vention would also be beneficial in indications such as cancer 
(e.g. in the radiosensitization of tumor patients). 

Another essential biological property of PARPs according to the 
invention and their functional equivalents is to be seen in their 

40 ability to bind an interacting partner. Human PARP2 and 3 differ 
from previously disclosed PARPs from higher eukaryotes such as, 
in particular, mammals by having potential so-called leucine 
zipper motifs. This is a typical motif for protein-protein 
interactions. It is possible that these motifs permit modulation 

45 of PARP activity by an interacting partner. This additional 

structural element thus also provides a possible starting point 
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for development of PARP effectors such as, for example, 
inhibitors - 

The invention thus further relates to proteins which interact 
5 with PARP 2 and/or 3, preferably those which bring about their 
activation or inactivation. 

The invention further relates to proteins which still have the 
abovementioned ligand-binding activity and which can be prepared 
10 starting from the specifically disclosed amino acid sequences by 
targeted modifications. 

It is possible, starting from the peptide sequence of the 
proteins according to the invention, to generate synthetic 

15 peptides which are employed, singly or in combination, as 

antigens for producing polyclonal or monoclonal antibodies. It is 
also possible to employ the PARP protein or fragments thereof for 
generating antibodies . The invention thus also relates to peptide 
fragments of PARP proteins according to the invention which 

20 comprise characteristic part-sequences, in particular those 

oligo- or polypeptides which comprises [sic] at least one of the 
abovementioned sequence motifs. Fragments of this type can be 
obtained, for example, by proteolytic digestion of PARP proteins 
or by chemical synthesis of peptides. 

25 

Novel specific PARP binding partners 

Active and selective inhibitors against the proteins according to 
the invention were developed using the specific assay systems de- 
30 scribed above for binding partners for PARPl, PARP 2 and PARP 3 . 

Inhibitors provided according to the invention have a strong in- 
hibitory activity on PARP 2 . The Ki values may in this case be less 
than about 100 nM, such as less than about 700 nM, less than 
35 about 100 nM and less than about 30 nM, e.g. about 1 to 20 nM. 

Inhibitors preferred according to the invention have a surpris- 
ingly marked selectivity for PARP 2 . This is shown by the 
Ki( PARPl) : Ki(PARP2) ratio for inhibitors according to the inven- 
40 tion which is, for example, greater than 5, preferably greater 
than 10 and, in particular , greater than 20 and is, for example, 
in the range of about 30-100, e.g. about 40-80. Another group of 
inhibitors was developed so that they inhibit PARPl and PARP 2 
simultaneously. 

45 
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An example which should be mentioned is 2 (4 (2-(N,N-diethyl- 
amino) eth-l-yloxy ) phenyl ) benzimidazole-4-carboxamide [sic] . This 
compound showed selectivity for PARP2 (Ki=7nM) relative to PARP1 
(Ki=200nM). 

5 

Nucleic acids coding for PARP homologs: 

Unless stated otherwise, nucleotide sequences are indicated in 
the present description from the 5' to the 3' direction, 

10 

The invention further relates to nucleic acid sequences which 
code for the abovementioned proteins, in particular for those 
having the amino acid sequence depicted in SEQ ID NO: 2, 4, 6, 8 
and 10, but without being restricted thereto. Nucleic acid 

15 sequences which can be used according to the invention also 

comprise allelic variants which, as described above for the amino 
acid sequences, are obtainable by deletion, inversion, insertion, 
addition and/or substitution of nucleotides, preferably of 
nucleotides shown in SEQ ID NO: 1, 3, 7 and 9, but with essential 

20 retention of the biological properties and the biological 

activity of the corresponding gene product. Nucleotide sequences 
which can be used are obtained, for example, by silent (without 
alteration of the amino acid sequence) or conservative (exchange 
of amino acids of the same size, charge, polarity or solubility) 

25 nucleotide substitutions. 

Nucleic acid sequences according to the invention also embrace 
functional equivalents of the genes, such as eukaryotic homologs 
for example from invertebrates such as Caenorhabditis or Droso- 
30 phila, or vertebrates, preferably from the mammals described 

above. Preferred genes are those from vertebrates which code for 
a gene product which has the properties essential to the 
invention as described above. 

35 The nucleic acids according to the invention can be obtained in a 
conventional way by various routes: 

For example, a genomic or a cDNA library can be screened for DNA 
which codes for a PARP molecule or a part thereof. For example, a 

40 DNA library obtained from human brain, heart or kidney can be 
screened with a suitable probe such as, for example, a labeled 
single-stranded DNA fragment which corresponds to a part-sequence 
of suitable length selected from SEQ ID NO: 1 or 3, or sequence 
complementary thereto. For this purpose, it is possible, for 

45 example, for the DNA fragments of the library which have been 
transferred into a suitable cloning vector to be, after 
transformation into a bacterium, plated out on agar plates. The 
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clones can then be transferred to nitrocellulose filters and, 
after denaturation of the DNA, hybridized with the labeled probe. 
Positive clones are then isolated and characterized. 

5 The DNA coding for PARP homologs according to the invention or 
partial fragments can also be synthesized chemically starting 
from the sequence information contained in the present 
application. For example, it is possible for this purpose for 
oligonucleotides with a length of about 100 bases to be 
10 synthesized and sequentially ligated in a manner known per se by, 
for example, providing suitable terminal restriction cleavage 
sites. 

The nucleotide sequences according to the invention can also be 
15 prepared with, the aid of the polymerase chain reaction (PCR) . For 
this, a target DNA such as, for example, DNA from a suitable 
full-length clone is hybridized with a pair of synthetic 
oligonucleotide primers which have a length of about 15 bases and 
which bind to opposite ends of the target DNA. The sequence 
20 section lying between them is then filled in with DNA polymerase. 
Repetition of this cycle many times allows the target DNA to be 
amplified (cf. White et al.(1989), Trends Genet. 5, 185). 

The nucleic acid sequences according to the invention are also to 
25 be understood to include truncated sequences, single-stranded DNA 
or RNA of the coding and noncoding, complementary DNA sequence, 
mRNA sequences and cDNAs derived therefrom. 

The invention further embraces nucleotide sequences hybridizing 
30 with the above sequences under stringent conditions. Stringent 

hybridization conditions for the purpose of the present invention 
exist when the hybridizing sequences have a homology of about 70 
to 100%, such as, for example about 80 to 100% or 90 to 100% 
(preferably in an amino acid section of at least about 40, such 
35 as, for example, about 50, 100, 150, 200, 400 or 500 amino 
acids ) . 

Stringent conditions for the screening of DNA, in particular cDNA 
banks, exist, for example, when the hybridization mixture is 
40 washed with 0.1X SSC buffer (20X SSC buffer = 3M NaCl, 0.3M 
sodium citrate, pH 7.0) and 0.1% SDS at a temperature of about 
60°C. 

Northern blot analyses are analyses are washed under stringent 
45 conditions with 0.1X SSC, 0,1% SDS at a temperature of about 65°C, 
for example. 



M/40043 



BASF Aktieng sellschaft 990154 O.Z. 0050/49790 



Nucleic acid derivatives and expression constructs: 

The nucleic acid sequences are also to be understood to include 
derivatives such as, for example, promoter variants or 
5 alternative splicing variants. The promoters operatively linked 
in front of the nucleotide sequences according to the invention 
may moreover be modified by nucleotide addition(s) or 
substitution(s) , inversion (s ) , insertion(s) and/or deletion(s), 
but without impairing the functionality or activity of the 
10 promoters. The promoters can also have their activity increased 
by modifying their sequence, or be completely replaced by more 
effective promoters even from heterologous organisms. The 
promoter variants described above are used to prepare expression 
cassettes according to the invention. 

15 

Specific examples of human PARP2 splicing variants which may be 
mentioned are: 

Variant human PARP2a: Deletion of base pairs 766 to 904 (cf. SEQ 
20 ID N0:1). This leads to a frame shift with a new stop codon 

("TAA" corresonding to nucleotides 922 to 924 in SEQ ID NO:l). 
Variant human PARP2b: Insertion of 

5 ' - gta tgc cag gaa ggt cat ggg cca gca aaa ggg tct ctg -3 ' 
after nucleotide 204 (SEQ ID NO:l). This extends the amino acid 
25 sequence by the insertion: GMPGRSWASKRVS 

Nucleic acid derivatives also mean variants whose nucleotide 
sequence [sic] in the region from -1 to -1000 in front of the 
start codon have been modified so that gene expression and/or 
30 protein expression is increased. 

Besides the nucleotide sequence described above, the nucleic acid 
constructs which can be used according to the invention comprise 
in functional, operative linkage one or more other regulatory 
35 sequences, such as promoters, amplification signals, enhancers, 
polyadenylation sequences, origins of replication, reporter 
genes, selectable marker genes and the like. This linkage may, 
depending on the desired use, lead to an increase or decrease in 
gene expression. 

40 

In addition to the novel regulatory sequences, it is possible for 
the natural regulatory sequence still to be present in front of 
the actual structural genes. This natural regulation can, where 
appropriate, be switched off by genetic modification, and the 
45 expression of the genes increased or decreased. However, the gene 
construct may also have a simpler structure, that is to say no 
additional regulatory signals are inserted in front of the 
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structural genes, and the natural promoter with its regulation is 
not deleted. Instead, the natural regulatory sequence is mutated 
in such a way that regulation no longer takes place, and gene 
expression is enhanced or diminished. It is also possible to 
5 insert additional advantageous regulatory elements at the 3 ' end 
of the nucleic acid sequences. The nucleic acid sequences can be 
present in one or more copies in the gene construct. 

Advantageous regulatory sequences for the expression method 
10 according to the invention are, for example, present in promoters 
such as cos, tac, trp, tet, trp-tet, lpp, lac, lpp-lac, laclq, 
T7, T5, T3, gal, trc, ara, SP6, 1-PR or the 1-PL promoter, which 
are advantageously used in Gram-negative bacteria. Other 
advantageous regulatory sequences are present, for example, in 
15 the Gram-positive promoters amy and SP02, in the yeast promoters 
ADC1, MFa, AC, P-60, CYC1, GAPDH or in the plant promoters 
CaMV/35S, SSU, OCS, lib4, usp, STLS1, B33, nos or in the 
ubiquitin or phaseolin promoter. 

20 It is possible in principle to use all natural promoters with 
their regulatory sequences. It is also possible and advantageous 
to use synthetic promoters . 

Said regulatory sequences are intended to make specific 
25 expression of the nucleic acid sequences and of [sic] protein 

expression possible. This may mean, for example, depending on the 
host organism that the gene is expressed or overexpressed only 
after induction, or that it is immediately expressed and/or 
overexpressed . 

30 

The regulatory sequences or factors may moreover preferably have 
a positive influence on, and thus increase or decrease, the 
expression. Thus, enhancement of the regulatory elements may 
advantageously take place at the level of transcription by using 
35 strong transcription signals such as promoters and/or enhancers. 
However, it is also possible to enhance translation by, for 
example, improving the stability of the mRNA. 

Enhancers mean, for example, DNA seqences which bring about 
40 increased expression via an improved interaction between RNA 
polymerase and DNA. 

The recombinant nucleic acid construct or gene construct is, for 
expression in a suitable host organism, advantageously inserted 
45 into a host-specific vector which makes optimal expression of the 
genes in the host possible. Vectors are well known to the skilled 
worker and are to be found, for example, in "Cloning Vectors" 
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(Pouwels P. H. et al., Ed., Elsevier, Amsterdam-New York-Oxford, 
1985). Apart from plasmids, vectors also mean all other vectors 
known to the skilled worker, such as, for example, phages, 
viruses, such as SV4 0, CMV, baculovirus and adenovirus, 
5 transposons, IS elements, phasmids, cosmids, and linear or 

circular DNA. These vectors may undergo autonomous replication in 
the host organism or chromosomal replication. 

Expression of the constructs: 

10 

The recombinant constructs according to the invention described 
above are advantageously introduced into a suitable host system 
and are expressed. Cloning and transfection methods familiar to 
the skilled worker are preferably used in order to bring about 
15 expression of said nucleic acids in the particular expression 
system. Suitable systems are described, for example, in Current 
Protocols in Molecular Biology, F. Ausubel et al., ed., Wiley 
Interscience, New York 1997. 

20 Suitable host organisms are in principle all organisms which make 
it possible to express the nucleic acids according to the 
invention, their allelic variants, their functional equivalents 
or derivatives or the recombinant nucleic acid construct. Host 
organisms mean, for example, bacteria, fungi, yeasts, plant or 

25 animal cells. Preferred organisms are bacteria such as those of 
the genera Escherichia, such as, for example, Escherichia coli, 
Streptomyces , Bacillus or Pseudomonas, eukaryotic microorganisms 
such as Saccharomyces cerevisiae, Aspergillus, higher eukaryotic 
cells from animals or plants, for example Sf9 or CHO cells. 

30 

The gene product can also, if required, be expressed in 
transgenic organisms such as transgenic animals such as, in 
particular, mice, sheep, or transgenic plants. The transgenic 
organisms may also be so-called knock-out animals or plants in 
35 which the corresponding endogenous gene has been switched off, 
such as, for example, by mutation or partial or complete 
deletion. 

The combination of the host organisms and the vectors appropriate 
40 for the organisms, such as plasmids, viruses or phages, such as, 
for example, plasmids with the RNA polymerase /promoter system, 
phages X, \i or other temperate phages or transposons and/or other 
advantageous regulatory sequences form [sic] an expression 
system. The term expression systems preferably means, for 
45 example, a combination of mammalian cells such as CHO cells, and 
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vectors, such as pcDNA3neo vector, which are suitable for 
mammalian cells. 

As described above, the gene product can also be expressed 
5 advantageously in transgenic animals, e.g. mice, sheep, or 

transgenic plants. It is likewise possible to program cell-free 
translation systems with the RNA derived from the nucleic acid. 

The gene product can also be expressed in the form of 
10 therapeutically or diagnostically suitable fragments. To isolate 
the recombinant protein it is possible and advantageous to use 
vector systems or oligonucleotides which extend the cDNA by 
certain nucleotide sequences and thus code for modified 
polypeptides which serve to simplify purification. Suitable 
15 modifications of this type are, for example , so-called tags which 
act as anchors, such as, for example, the modification known as 
the hexa-histidine anchor, or epitopes which can be recognized as 
antigens by antibodies (described, for example, in Harlow, E. and 
Lane, D., 1988, Antibodies: A Laboratory Manual. Cold Spring Har- 
20 bor (N.Y.) Press). These anchors can be used to attach the 
proteins to a solid support such as, for example, a polymer 
matrix, which can, for example, be packed into a chromatography 
column, or to a microtiter plate or to another support. 

25 These anchors can also at the same time be used to recognize the 
proteins. It is also possible to use for recognition of the 
proteins conventional markers such as fluorescent dyes, enzyme 
markers which form a detectable reaction product after reaction 
with a substrate, or radioactive markers, alone or in combination 

30 with the anchors for derivatizing the proteins. 

Production of antibodies: 

Anti-PARP2 antibodies are produced in a manner familiar to the 
35 skilled worker. Antibodies mean both polyclonal, monoclonal, 

human or humanized antibodies or fragments thereof, single chain 
antibodies or else synthetic antibodies, likewise antibody 
fragments such as Fv, Fab and (Fab)' 2 [sic]. Suitable production 
methods are described, for example, in Campbell, A.M., Monoclonal 
40 Antibody Technology, (1987) Elsevier Verlag, Amsterdam, New York, 
Oxford and in Breitling, F. and Dubel, S., Rekombinante Antikor- 
per (1997), Spektrum Akademischer Verlag, Heidelberg. 

45 
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Further use of the coding sequence: 

The present cDNA additionally provides the basis for cloning the 
genomic sequence of the novel PARP gene. This also includes the 
5 relevant regulatory or promoter sequence, which is available, for 
example, by sequencing the region located 5' upstream of the cDNA 
according to the invention. The cDNA sequence information is also 
the basis for producing antisense molecules or ribozymes with the 
aid of known methods (cf. Jones, J.T. and Sallenger, B.A. (1997) 
10 Nat. Biotechnol. 15, 902; Nellen, W. and Lichtenstein, C. (1993) 
TIBS, 18, 419). The genomic DNA can likewise be used to produce 
the gene constructs described above. 

Another possibility of using the nucleotide sequence or parts 
15 thereof is to generate transgenic animals. Transgenic 

overexpression or genetic knock-out of the sequence information 
in suitable animal models may provide further valuable 
information about the (patho) physiology of the novel enzymes. 

20 Therapeutic applications: 

In situations where there is a prevailing deficiency of a protein 
according to the invention it is possible to employ several 
methods for replacement. On the one hand, the protein, natural or 

25 recombinant, can be administered directly or by gene therapy in 
the form of its coding nucleic acid (DNA or RNA) . It is possible 
to use any suitable vectors for this, for example both viral and 
non-viral vehicles. Suitable methods are described, for example, 
by Strauss and Barranger in Concepts in Gene Therapy (1997), Wal- 

30 ter de Gruyter, publisher. Another alternative is provided by 
stimulation of the endogenous gene by suitable agents. 

It is also possible to block the turnover or the inactivation of 
PARPs according to the invention, for example by proteases. 
35 Finally, inhibitors or agonists of PARPs according to the 
invention can be employed. 

In situations where a PARP is present in excess or is 
overactivated, various types of inhibitors can be employed. This 
40 inhibition can be achieved both by antisense molecules, 

ribozymes, oligonucleotides or antibodies, and by low molecular 
weight compounds . 

Nontherapeutic applications : 

45 
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The nucleic acids according to the invention, such as, for 
example, cDNA, the genomic DNA, the promoter, and the 
polypeptide, and partial fragments thereof, can also be used in 
recombinant or nonrecombinant form for developing various test 
5 systems . 

For example, it is possible to establish a test system which is 
suitable for measuring the activity of the promoter or of the 
protein in the presence of a test substance . The methods of 

10 measurement in this case are preferably simple ones, e.g. 
color imetric, luminometric , f luorimetric, immunological or 
radioactive, and allow preferably a large number of test 
substances to be measured rapidly. Tests of this type are 
suitable and advantageous for so-called high-throughput 

15 screening. These test systems allow test substances to be 

assessed for their binding to or their agonism, antagonism or 
inhibition of proteins according to the invention. 

Determination of the amount, activity and distribution of the 
20 proteins according to the invention or their underlying mRNA in 
the human body can be used for the diagnosis, for the 
determination of the predisposition and for the monitoring of 
certain diseases. Likewise, the sequence of the cDNA and of [sic] 
the genomic sequence may provide information about genetic causes 
25 of and predispositions to certain diseases. It is possible to use 
for this purpose both DNA/RNA probes and antibodies of a wide 
variety of types. The nucleotide sequences according to the 
invention or parts thereof can further be used in the form of 
suitable probes for detecting point mutations, deletions or 
30 insertions. 

The proteins according to the invention can further be used to 
identify and isolate their natural ligands or interacting 
partners. The proteins according to the invention can 

35 additionally be used to identify and isolate artificial or 

synthetic ligands. For this purpose, the recombinantly prepared 
or purified natural protein can be derivatized in such a way that 
it has modifications which permit linkage to support materials. 
Proteins bound in this way can be incubated with various 

40 analytes, such as, for example, protein extracts or peptide 
libraries or other sources of ligands. Specifically bound 
peptides, proteins or low molecular weight, non-proteinogenous 
substances can be isolated and characterized in this way. 
Non-proteinogenous substances mean, for example, low molecular 

45 weight chemical substances which may originate, for example, from 
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classical drug synthesis or from so-called substance libraries 
which have been synthesized combinatorially . 

The protein extracts used are derived, for example, from 
5 homogenates of plants or parts of plants, microorganisms, human 
or animal tissues or organs. 

Ligands or interacting partners can also be identified by methods 
like the yeast two-hybrid system (Fields, S. and Song, 0. (1989) 
10 Nature, 340, 245). The expression banks which can be employed in 
this case may be derived, for example, from human tissues such 
as, for example, brain, heart, kidney etc. 

The nucleic acid sequences according to the invention and the 
15 proteins encoded by them can be employed for developing reagents, 
agonists and antagonists or inhibitors for the diagnosis and 
therapy of chronic and acute diseases associated with the 
expression of one of the protein sequences according to the 
invention, such as, for example, with increased or decreased 
20 expression thereof. The reagents, agonists, antagonists or 
inhibitors developed can subsequently be used to produce 
pharmaceutical preparations for the treatment or diagnosis of 
disorders. Examples of possible diseases in this connection are 
those of the brain, of the peripheral nervous system, of the 
25 cardiovascular system or of the eye, of septic shock, of 
rheumatoid arthritis, diabetes, acute kidney failure, or of 
cancer. 

The relevance of the proteins according to the invention for said 
30 indications was verified using specific inhibitors in relevant 
animal models (see examples). In a model of neurodegenrative 
[sic] disorders (NMDA excitotoxicity ) , the specific PARP2 
inhibitor 2 ( 4 ( 2- (N, N-diethylamino)eth-l-yloxy ) phenyl ) - 
benzimidazole-4-carboxamide [sic] had surprisingly good activity 
35 (ED50 < 100 mg/kg). 

The invention is now illustrated in detail with reference to the 
following examples . 

40 Example 1: Isolation of the PARP2 and PARP3 cDNA 

The present cDNA sequences were found for the first time on 
sequence analysis of cDNA clones of a cDNA library from human 
brain (Human Brain 5'Stretch Plus cDNA Library, # HL3002a, Clon- 
45 tech) . The mouse PARP3 clones were isolated from a "ltriplex 
mouse brain cDNA library" (Clontech order No. ML5004t). The 
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sequences of these clones are described in SEQ ID NO:l, 3, 7 and 
9. 

Example 2: Expression of PARP2 and PARP3 in human tissues 

5 

The expression of human PARP2 and human PARP3 was investigated in 
eight different human tissues by Northern blot analysis. A Human 
Multiple Tissue Northern Blot ( MTN ™ ) supplied by Clontech 
(#7760-1 and #7780-1) was hybridized for this purpose with an RNA 

10 probe . The probe was produced by in vitro transcription of the 
corresponding cDNA of human PARP2 and human PARP3 in the presence 
of digoxigenin-labeled nucleotides in accordance with the 
manufacturer's method (BOEHRINGER MANNHEIM DIG Easy Hyb order No. 
1603 558, DIG Easy Hyb method for RNA : RNA hybridization). The 

15 protocol was modified to carry out the prehybridization: 2xlh 
with addition of herring sperm DNA (10 mg/ml of hybridization 
solution) . Hybridization then took place overnight with addition 
of herring sperm DNA (10 mg/ml of hybridization solution). The 
bands were detected using the CDP-Star protocol (BOEHRINGER 

20 MANNHEIM CDP-Star™ order No. 1685 627). 

After stringent washing, the transcript of PARP2 was mainly 
detected in human brain, heart, skeletal muscle, kidney and 
liver. The transcript size of about 1.9 kb corresponds to the 
25 length of the cDNA determined (1.85kb) (cf. Figure 2(A)). 

In other tissues or organs, human PARP2 expression is 
considerably weaker. 

30 After stringent washing, the transcript of PARP3 was mainly 
expressed in heart, brain, kidney, skeletal muscle and liver. 
Expression in other tissues (placenta, lung, pancreas) is 
distinctly weaker (cf. Figure 2(B)). There are at least 2 
transcripts for human PARP3, which can presumably be explained by 

35 different polyadenylation sites or alternative splicing. Their 
size (about 2.2 kb and 2.5 kb respectively) corresponds to the 
length of the cDNA determined (2.3kb). Washing was carried out 
with 0.2 SSC/0.2% SDS at room temperature for 2 x 15 minutes and 
then with 0.1 x SSC/0.1% SDS at 65°C for 2 x 15 minutes (prepared 

40 from 2 OX SSC: 3M NaCl, 0.3M sodium citrate, pH 7.0). 

Example 3 : Production of antibodies 

Specific antibodies against the proteins according to the inven- 
45 tion were produced. These were used inter alia for analyzing the 
tissue distribution at the protein level of PARP2 and PARP3 by 
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immunoblot (Western blot) analysis. Examples of the production of 
such antibodies are indicated below. 

The following peptides were prepared by synthesis in the manner 
5 familiar to the skilled worker for the antibody production. In 
some cases, a cysteine residue was attached to the N or C ter- 
minals of the sequences in order to facilitate coupling to KLH 
(keyhole limpet hemocyanin) . 

10 PARP-2: NH 2 -MAARRRRSTGGGRARALNES-C0 2 H (amino acids 1-20) 

NH 2 -KTELQSPEHPLDQHYRNLHC-C0 2 H (amino acids 335-353) 
PARP-3: NH 2 -CKGRQAGREEDPFRSTAEALK-C0 2 H (amino acids 25-44) 
NH 2 -CKQQIARGFEALEALEEALK-C0 2 H (amino acids 230-248) 

15 The production of an anti-PARP3 antibody is described as a repre- 
sentative example. 

For human PARP3, polyclonal antibodies were raised in rabbits 
using a synthetic peptide having the peptide sequence H 2 N-KQQIARG- 
20 FEALEALEEALK-C0 2 H (amino acids 230-248 of the human PARP3 protein 
sequence) . The corresponding mouse sequence differs in this re- 
gion only by one amino acid (H 2 N-KQQIARGFEALEALEEAMK^C0 2 H) . A cys- 
teine was also attached to the N terminus in order to make it 
possible for the protein to couple to KLH. 

25 

Rabbits were immunized a total of five times, at intervals of 
7-14 days, with the KLH-peptide conjugate. The antiserum obtained 
was affinity-purified using the antigen. The specific IgG frac- 
tion was isolated from the serum using the respective peptides 

30 which, for this purpose, were initially immobilized on an affin- 
ity column in the manner familiar to the skilled worker. The re- 
spective antiserum was loaded onto this affinity column, and non- 
specifically sorbed proteins were eluted with buffer. The spe- 
cifically bound IgG fraction was eluted with 0.2 M glycine/HCl 

35 buffer pH 2.2. The pH was immediately increased using a 1M TRIS/ 
HC1 buffer pH 7.5. The eluate containing the IgG fraction was 
mixed 1:1 (volume) with saturated ammonium sulfate solution and 
incubated at +4°C for 30 min to complete the precipitation. The 
resulting precipitate was centrifuged at 10,000 g and, after re- 

40 moval of the supernatant, dissolved in the minimum amount of PBS/ 
TBS. The resulting solution was then dialyzed against PBS/TBS in 
the ratio 1:100 (volume). The antibodies were adjusted to a con- 
centration of about 100 jig of IgG/ml. The PARP3 antibodies puri- 
fied in this way had high specificity for PARP3 . Whereas mouse 

45 PARP3 was recognized well, there was no observable cross-reaction 
with PARP1 or PARP2 . 
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Example 4: Analysis of the tissue distribution by immunoblot 
(Western blot) 

The tissue distribution at the protein level was also investi- 
5 gated for PARP2 and PARP3 by immunoblot (Western blot) analysis. 

Preparation of the mouse tissues for protein gels: 

Tissues or cells were homogenized using a Potter or Ultra-Turrax. 

10 For this, 0.5 g of tissue (or cells) was incubated in 5 ml of 

buffer (10 mM Tris-HCl pH 7.5, 1 mM EDTA, 6 mM MgCl 2 ) one tablet 
of protease inhibitor cocktail (Boehringer Mannheim, order No.: 
1836153) and benzonase (purity grade I, MERCK) at 37°C for 30 min. 
Tissue samples from mice were produced for heart, lung, liver, 

15 spleen, kidney, intestine, muscle, brain and for human embryonic 
kidney cells (HEK293, human embryonal kidney). 

Protein gels: 

20 The NuPAGE system supplied by NOVEX was used according to the 

instructions for protein gels. Polyacrylamide gels (NuPAGE 4-12% 
BisTris, NOVEX NP 0321), running buffer (MES-Running Buffer, 
NOVEX NP 0002), antioxidant (NOVEX NP 0005), protein size stan- 
dard (Multi Mark Multi Colored Standard, NOVEX LC 5725), sample 

25 buffer (NuPAGE LDS Sample Buffer (4X), NOVEX NP 0007) were used. 
The gels were run for 45 minutes at a voltage of 200 V. 

Western blot: 

30 Western blots were carried out using the NOVEX system in accord- 
ance with instructions. A nitrocellulose membrane (Nitrocellulose 
Pore size 45 pm, NOVEX LC 2001) was used. The transfer took 1 hour 
at a current of 200 mA. The transfer buffer consisted of 50 ml of 
transfer buffer concentrate (NOVEX NP 0006), 1 ml of antioxi- 

35 dant (NOVEX NP 0002), 100 ml of analytical grade methanol and 
849 ml of double-distilled water. 

Besides the blots produced in this way, also used were premade 
blots, for example from Chemicon (mouse brain blot, Chemicon, 
40 catalog No.: NS 106 with the tissues 1. frontal cortex, 2. pos- 
terior cortex, 3. cerebellum, 4. hippocampus, 5. olfactory bulb, 
6. striatum, 7. thalamus, 8. mid brain, 9. entorhinal cortex, 10. 
pons, 11. medulla, 12. spinal cord). 
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Antibody reaction with PARP3 : 

The Western blots were blocked in TBST (TBS + 0.3 % Tween 20) 
with 5% dry milk powder for at least 2 hours (TBS: 100 mM Tris pH 
5 7.5, 200 mM NaCl). The antibody reaction with the primary anti- 
body (dilution 1:1000) took place in TBST with 5% dry milk powder 
(see above at room temperature for at least 2 hours or at 4°C 
overnight, with gentle agitation (vertical rotator) . This was 
followed by washing three times in TBST for 5 minutes. Incubation 

10 with the secondary antibody (anti-rabbit IgG, peroxidase-coupled, 
SIGMA A-6154, dilution 1:2000) took place in TBST with 5% dry 
milk powder for 1 hour. This was followed by washing three times 
for 5 minutes each time as above. The subsequent detection was 
based on chemiluminescence using the SUPER BLAZE kit (Pierce, 

15 Signal BLAZE Chemiluminescent Substrate 34095) as stated by the 
manufacturer. The "Lumi-Film" (Chemiluminescent Detection Film, 
Boehringer order No: 1666916 were [sic] used. The films were de- 
veloped for about 2 min (X-ray developer concentrate, ADEFO- 
Chemie GmbH), hydrated, fixed for about 4 min (Acidofix 85 g/1 / 

20 AGFA), hydrated and then dried. 

Example 5 : Preparation of the enzymes 

For comparison, human PARP1 was expressed recombinantly in the 

25 baculovirus system in the manner familiar to the skilled worker 
and partially purified as described (Shah et al., Analytical Bio- 
chemistry 1995, 227, 1-13). Bovine PARPl in a purity of 30-50% 
(c= 0.22 mg/ml, spec, activity 170 nmol of ADP-ribose/min/mg of 
total protein at 25°C) was purchased from BIOMOL (order No. 

30 SE-165). Human and mouse PARP2 and PARP3 were expressed recombi- 
nantly in the baculovirus system (Bac-to-Bac system, BRL LifeSci- 
ence) . For this purpose, the appropriate cDNAs were cloned to the 
pFASTBAC-1 vector. Preparation of recombinant baculovirus DNA by 
recombination in E. coli was followed by transfection of insect 

35 cells (Sf9 or High-Five) with the appropriate recombinant bacu- 
lovirus DNAs. Expression of the corresponding proteins was veri- 
fied by Western blot analysis. Virus strains were amplified in 
the manner familiar to the skilled worker. Larger amounts of re- 
combinant proteins were infected [sic] by infecting 500 ml of in- 

40 sect cell culture (2 x 10 6 cells/ml) with viruses in an MOI 

(multiplicity of infection; ratio of viruses to cells) of 5-10 
and incubated for 3 to 4 days. The insect cells were then pel- 
leted by centrif ugation, and the proteins were purified from the 
pellet. 

45 
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The purification took place by classical methods of protein puri- 
fication familiar to the skilled worker, detecting the enzymes 
with appropriate specific antibodies. In some cases, the proteins 
were also affinity-purified on a 3-aminobenzamide affinity column 
5 as described (Burtscher et al., Anal Biochem 1986, 152:285-290), 
The purity was >90%. 

Example 6: Assay systems for determining the from [sic] activity 
of PARP2 and PARP3 and the inhibitory action of effectors on 
10 PARP1, PARP2 and PARP3. 

a) Production of antibodies against poly (ADP-ribose) 

It is possible to use poly (ADP-ribose) as antigen for generating 
15 anti-poly (ADP-ribose) antibodies. The production of anti- 
poly (ADP-ribose ) antibodies is described in the literature. 
(Kanai Y et al. (1974) Biochem Biophys Res Comm 59:1, 300-306; 
Kawamaitsu H et al. (1984) Biochemistry . 23 , 3771-3777; Kanai Y 
et al. (1978) Immunology 34, 501-508). 

20 

The following were used, inter alia: anti-poly (ADP-ribose) anti- 
bodies (polyclonal antiserum, rabbits), BIOMOL; order No. SA-276, 
anti-poly (ADP-ribose) antibodies (monoclonal, mouse; clone 10H; 
hybri-oma [sic] supernatant, affinity-purified). 

25 

The antisera or monoclonal antibodies obtained from hybridoma 
supernatant were purified by protein A affinity chromatography in 
the manner familiar to the skilled worker. 

30 b) ELISA assay 

Materials: 

ELISA color reagent: TMB mix, SIGMA T-8540 

35 

A 96-well microtiter plate (FALCON Micro-Test Ilia Flexible Assay 
Plate, # 3912) was coated with histones (SIGMA, H-7755). Histones 
were for this purpose dissolved in carbonate buffer (0.05M 
Na2HC03; pH 9.4) in a concentration of 50 fig/ml. The individual 

40 wells of the microtiter plate were each incubated with 150 ^1 of 
this histone solution at room temperature for at least 2 hours or 
at 4°C overnight. The wells are then blocked by adding 150 \il of a 
1% strength BSA solution (SIGMA, A-7888) in carbonate buffer at 
room temperature for 2 hours. This is followed by three washing 

45 steps with washing buffer (0.05% TweenlO in lx PBS; PBS (Phos- 
phate buffered saline; Gibco, order No. 10010): 0.21g/l KH 2 P0 4 , 
9g/l NaCl, 0.726g/l Na 2 HP0 4 • 7H 2 0, pH 7.4). Washing steps were 
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all carried out in a microtiter plate washer ("Columbus" micro- 
titer plate washer , SLT-Labinstruments , Austria). 

Required for the enzyme reaction were an enzyme reaction solution 
5 and a substrate solution, in each case as a premix. The absolute 
amount of these solutions depended on the intended number of 
assay wells. 

Composition of the enzyme reaction solution per well: 
10 - 4 \xl of PARP reaction buffer ( 1M Tris-HCl pH 8.0, lOOmM MgCl 2 , 
lOmM DTT ) 

- 20ng of PARP1 (human or bovine) or 8ng PARP 2 (human or mouse) 

- 4 \il of activated DNA (1 mg/ml; SIGMA, D-4522) 

- H 2 0 ad 40 \il 

15 

Composition of the substrate solution per well: 

- 5 \il of PARP reaction buffer (lOx) 

- 0.8 |il of NAD solution (lOmM, SIGMA N-1511) 

- 44 [xl H 2 0 

20 

Inhibitors were dissolved in lx PARP reaction buffer. DMSO, that 
was occasionally used to dissolve inhibitors in higher 
concentrations, was no problem up to a final concentration of 2%. 
For the enzyme reaction, 40 \il of the enzyme reaction solution 
25 were introduced into each well and incubated with 10 \il of 

inhibitor solution for 10 minutes. The enzyme reaction was then 
started by adding 50 |il of substrate solution per well. The 
reaction was carried out at room temperature for 30 minutes and 
then stopped by washing three times with washing buffer. 

30 

The primary antibodies employed were specific anti-poly (ADP-ri- 
bose) antibodies in a dilution of 1:5000. Dilution took place in 
antibody buffer (1% BSA in PBS; 0.05% Tween20). The incubation 
time for the primary antibodies was one hour at room temperature. 

35 After subsequently washing three times with washing buffer, in- 
cubation was carried out with the secondary antibody (anti-mouse 
IgG, Fab fragments, peroxidase-coupled, Boehringer Mannheim, 
order No. 1500.686; anti-rabbit IgG, peroxidase-coupled, SIGMA, 
order No. A-6154) in a dilution of 1:10,000 in antibody buffer at 

40 room temperature for one hour. Washing three times with washing 
buffer was followed by the color reaction using 100 \xl of color 
reagent (TMB mix, SIGMA) per well at room temperature for about 
15 min. The color reaction was stopped by adding 100 \il of 2M 
H2SO4. This was followed by immediate measurement in an ELISA 

45 plate reader (EAR34 0AT "Easy Reader", SLT-Labinstruments, Aus- 
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tria) (450nm versus 620nm) . The measurement principle is depicted 
diagrammatically in Figure 6, 

Various concentrations were used to construct a dose-effect plot 
5 to determine the Ki value of an inhibitor. Values are obtained in 
triplicate for a particular inhibitor concentration. Arithmetic 
means are determined using Microsoft© Excel. The IC50 
is determined using the Microcal© Origin Software (Vers. 5.0) 
("Sigmoidal Fit"). Conversion of the IC50 value is calculated in 
10 this way into Ki values took place by using "calibration inhibi- 
tors". The "calibration inhibitors" were also measured in each 
analysis. The Ki values of the "calibration inhibitors" were de- 
termined in the same assay system by analysis of the Dixon dia- 
gram in the manner familiar to the skilled worker. 

15 

b) HTRF (homogenous time-resolved fluorescence) assay 

In the HTFR [sic] PARP assay according to the invention, hi- 
stones, as target proteins for modification by PARP , are labeled 

20 indirectly with an XL665 fluorophore. The antibody is directly 
labeled with a europium cryptate. If the XL665 fluorophore is in 
the direct vicinity in space, which is ensured by binding to the 
poly ( ADP-ribose ) on the histone, then energy transfer is poss- 
ible. The emission at 665 nm is thus directly proportional to the 

25 amount of bound antibody, which in turn is equivalent to the 

amount of poly (ADP-ribose ) . The measured signal thus corresponds 
to the PARP activity. The measurement principle is depicted dia- 
grammatically in Figure 7. The materials used are identical to 
those used in the ELISA assay (see above) unless expressly indi- 

30 cated. 

Histones were dissolved in a concentration of 3 mg/ml in Hepes 
buffer (50mM, pH=7.5). Biotinylation took place with 
sulfo-NHS-LC-biotin (Pierce, #21335T). A molar ratio of 4 biotin 
35 per histone was used. The incubation time was 90 minutes (RT). 
The biotinylated histones were then purified on a G25 SF HR10/10 
column (Pharmacia, 17-0591-01) in Hepes buffer (50mM, pH~7.0) in 
order to remove excess biotinylation reagent. The 

anti-poly (ADP-ribose ) antibody was labeled with europium cryptate 
40 using bifunctional coupling reagents (Lopez, E. et al., Clin. 
Chem. 39(2), 196-201 (1993); US Patent 5,534,622). Purification 
took place on a G25SF HR10/30 column. A molar ratio of 3.1 
cryptates per antibody was achieved. The yield was 25%. The 
conjugates were stored at -80°C in the presence of 0.1% BSA in 
45 phosphate buffer (0.1M, pH=7 ) . 

For the enzyme reaction, the following were pipetted into each 
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well: 

- 10 \xl of PARP solution in PARP HTRF reaction buffer (50mM Tris- 
HC1 pH 8.0, lOmM MgC12, ImM DTT) with 20ng of PARP1 (human or 
bovine) or 8ng of PARP 2 (human or mouse) 

5 - 10 \il of activated DNA in PARP HTRF reaction buffer (50(xg/ml) 

- 10 \xl of biotinylated histones in PARP HTRF reaction buffer 
(1.25mM) 

« 10 (il of inhibitor in PARP HTRF reaction buffer 

10 These reagents were incubated for 2 minutes before the reaction 
was started by adding 

- 10 [xl of NAD solution in PARP HTRF reaction buffer (41 ^iM/ml 
[sic]). 

The reaction time was 30 minutes at room temperature. 

15 

The reaction was then stopped by adding 

- 10 \il of PARP inhibitor (25 \xM, Ki=10nM) in "Revelation" buffer 
(lOOmM Tris-HCl pH 7.2, 0 . 2M KF, 0.05% BSA) . 

20 The following were then added: 

- 10 M-l of EDTA solution (SIGMA, E-7889, 0.5M in H 2 0) 

- 100 \il of Sa-XL665 (Packard Instruments) in "Revelation" buffer 
(15-31. 25nM) 

- 50 \xl of anti-PARP cryptate in "Revelation" buffer (1.6-3.3nM). 

25 

Measurement was then possible after 30 minutes (up to 4 hours). 
The measurement took place in a "discovery HTRF microplate ana- 
lyzer" (Packard Instruments). The Ki values were calculated as de- 
scribed for the ELISA assay. 

30 

Example 7: Test systems for determining the therapeutic efficacy 
of PARP inhibitors 

Novel PARP inhibitors can have their therapeutic efficacy checked 
35 in relevant pharmacological models. Examples of some suitable mo- 
dels are listed in Table 1. 



Disorder 


Model 


Literature 


Neurodegenerative 
disorders ( stroke , 
Parkinson's, etc.) 


NMDA excitotoxicity 
in mice or rats 


See below for des- 
cription 



45 
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5 


Stroke 


Permanent MCAO 
("middle cerebral 
artherial [sic] 
occlusion" ) 


Tokime, T. et al., 
J. Cereb. Blood Flow 
Metab., 18(9): 
991-7, 1998. 
Guegan, C, Brain 
Research . Molecular 
Brain Research, 
55(1): 133-40, 1998. 


10 




Transient, focal 
MCAO in rats or mice 


Eliasson MJL et al., 
Nat Med 1997, 
3:1089-1095. 

Endres, M et al., J 
Cereb Blood Flow 
Metab 1997, 
17: 1143-1151. 


15 






Takahashi K et al., 
J Cereb Blood Flow 
Metab 1997, 
17:1137-1142. 


20 


Parkinson ' s disease 


MPTP (1-methyl- 
4-phenyl-l , 2,3,6- 
tetrahydropyridine ) 
toxicity in mice/ 
rats 


Cosi C, et al. , 
Brain Res. , 1998 
809(1) :58-67. 

Cosi C, et al. , 
Brain Res . , 1996 
729(2 ) :264-9. 


25 


Myocardial infarct 


Coronary vessel 
occlusion in rats, 
pigs or rabbits 


Richard V, et al., 
Br. J. Pharmacol 
1994, 113, 869-876. 

Thiemermann C, et 
al., Proc Natl Acad 
Sci USA. 1997, 
94(2) :679-83. 


30 






Zingarelli B, et al. 
, Cardiovasc Res. 
1997, 36(2) :205-15. 


35 




Langendorf heart 
model in rats or 
rabbits 


See below for des- 
cription 




Septic shock 


Endotoxin shock in 
rats 


Szabo C, et al., J 
Clin Invest, 1997, 
100(3) :723-35. 


40 




Zymosan- or 
carrageenan-induced 
multiple organ 
failure in rats or 
mice 


Szabo C, et al. J 
Exp Med. 1997, 
186(7) :1041-9.Cuzzo- 
crea S, et al. Eur J 
Pharmacol. 1998, 
342(1) :67-76. 


45 


Rheumatoid arthritis 


Adjuvant- or 
collagen-induced 
arthritis in rats or 
mice 


Szabo C, et al . , 
Proc Natl Acad Sci 
USA. 1998, 
95(7) :3867-72. 
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5 



10 



Diabetes 


Streptozotocin- and 
alloxan-induced or 
obesity-associated 


Uchgata Y et al., 
Diabetes 1983, 32: 
316-318 .Masiello P 
et al., Diabetologia 
1985, 28: 

683-686 . Shimabukuro 
M et al., J Clin In- 
vest 1997, 100: 
290-295. 


Cancer 


In vitro model; 
see below 


Schlicker et al., 
1999, 75(1), 91-100. 



a) NMDA excitotoxicity model 

Glutamate is the most important excitotory [sic] neurotransmitter 
15 in the brain. Under normal conditions, glutamate is secreted into 
the synaptic cleft and stimulates the post-synaptic glutamate re- 
ceptors, specifically the glutamate receptors of the "NMDA" and 
"AMPA" types. This stimulation plays a significant part in numer- 
ous functions of the brain, including learning, memory and motor 
20 control. 

Under the conditions of acute and chronic neurodegeneration (e.g. 
stroke), however, there is a great increase in the presynaptic 
glutamate secretion, resulting in excessive stimulation of the 

25 receptors. This leads to death of the cells stimulated in this 
way. These increased glutamate activities occur in a number of 
neurological disorders or psychological disturbances and lead to 
states of overexcitation or toxic effects in the central nervous 
system (CNS) but also in the peripheral nervous system. Thus, 

30 glutamate is involved in a large number of neurodegenerative dis- 
orders, in particular neurotoxic disturbances following hypoxia, 
anoxia, ischemia and after lesions like those occurring after 
stroke and trauma, and stroke, Alzheimer's disease, Huntington's 
disease, amyotrophic lateral sclerosis (ALS; "Lou Gehring's dis- 

35 ease"), cranial trauma, spinal cord trauma, peripheral neuro- 
pathies, AIDS dementia and Parkinson ' s disease . Another disease 
in which glutamate receptors are important is epilepsy (cf . Brain 
Res Bull 1998; 4 6 ( 4 ) : 28 1-309 , Eur Neuropsychopharmacol 1998, 
8(2) :141-52. ) . 

40 

Glutamate effects are mediated through various receptors. One of 
these receptors is called the NMDA (N-methyl-D-aspartate ) recep- 
tor after a specific agonist ( Arzneim.Forschung 1990, 40, 
511-514; TIPS, 1990, 11, 334-338; Drugs of the Future 1989, 14, 
45 1059-1071). N-Methyl-D-aspartate is a strong agonist of a par- 
ticular class of glutamate receptors ( "NMDA" type ) . Stimulation 
of the NMDA receptor leads to influx of calcium into the cell and 
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the generation of free radicals. The free radicals lead to DNA 
damage and activation of PARP. PARP in turn causes cell death 
through depletion of high-energy phosphates (NAD and ATP) in the 
cell. This explains the toxicity of NMDA. Treatment of animals 
5 with NMDA can therefore be regarded as a model of the abovemen- 
tioned disorders in which excitotoxicity is involved. 

Because of the importance of glutamate receptors in neurodegener- 
ation, many pharmacological approaches to date have been directed 

10 at specific blocking of precisely these receptors. However, be- 
cause of their importance in normal stimulus conduction, these 
approaches have proved to be problematic (side effects). In addi- 
tion, stimulation of the receptors is an event which takes place 
very rapidly so that administration of the receptors often comes 

15 too late ("time window" problem). Thus there is a great need for 
novel principles of action and inhibitors of NMDA-related neuro- 
toxicity. 

Protection against cerebral overexcitation by excitatory amino 
20 acids (NMDA antagonism in mice) can be regarded as adequate proof 
of the activity of a pharmacolical [sic] effector of PARP in dis- 
orders based on excitotoxicity. Intracerebral administration of 
excitatory amino acids (EAA) induces such massive overexcitation 
that it leads within a short time to convulsions and death of the 
25 animals (mice). 

In the present case there was unilateral intracerebroventricular 
administration of 10 fxl of a 0.035% strength aqueous NMDA solution 
120 minutes after intraperitoneal (i.p.) administration of the 

30 test substance. These symptoms can be inhibited by systemic, e.g. 
intraperitoneal, administration of centrally acting drugs. Since 
excessive activation of EAA receptors in the central nervous 
system plays an important part in the pathogenesis of various 
neurological disorders, information can be gained from the 

35 detected EAA antagonism in vivo about possible therapeutic 

utilizability of the substances for such CNS disorders. An ED50 
at which 50% of the animals are, due to preceding i.p. 
administration of the measured substance, free of syptoms [sic] 
with a fixed dose of either [sic] NMDA was determined as a 

40 measure of the activity of the substances. 

The specific PARP 2 inhibitor 2 (4 (2-(N,N-diethylamino)- 
eth-l-yloxy) phenyl )benzimidazole-4-carboxamide [ sic ] surprisingly 
shows an activity with an ED50 of about 30 mg/kg in this test. 



M/40043 



, BASF Akti ngesellschaf t 990154 O.Z. 0050/49790 

40 

b) Langendorf heart model (model for myocardial infarct) 

Male Sprague-Dawley rats (bodyweight 300-400 g; origin Janvier, 
Le Genest-St-Isle, France) were used for the test. The rats were 
5 treated orally by gavage with the active substance or placebo 
(volume: 5 ml/kg). 50 minutes later, heparin is administered 
intraperitoneally (Liquemin N Roche, 125 IU/animal in 0.5 ml). 
The animals are anesthesized with Inactin® T133 ( thiobetabarbital 
sodium 10%), fixed on the operating table, tracheotomized and 

10 ventilated with a "Harvard ventilatory pump" (40 beats/min, 

4.5 ml/beat). Thoracotomy was followed by immediate catheteriz- 
ation of the aorta, removal of the heart and immediate retrograde 
perfusion. The hearts were perfused with a constant pressure of 
75 mmHg, which is achieved using a "Gilson Miniplus 2 perfusion 

15 pump". Composition of the perfusate (mmol/1): NaCl 118, KC1 4.7, 
CaCl 2 x 2 H 2 0 2.52, MgS0 4 x 7 H 2 0 1.64, NaHC0 3 24.88, KH 2 P0 4 1.18, 
glucose 11. The temperature is kept at 37°C throughout the experi- 
ment. Functional parameters were continuously recorded using a 
"Gould 4-channel recorder". Measurements were made of the left- 

20 ventricular pressure (LVP; mmHg), LVEDP (mmHg), enzyme release 
(creatine kinase, mU/ml/g), coronary flow rate (ml/min), HR 
(pulse rate, min- 1 ). The left-ventricular pressure was measured 
using a liquid-filled latex balloon and a Statham23 Db pressure 
transducer. The volume of the balloon was initially adjusted to 

25 reach an LVEDP ( lef t-venricular [sic] end-diastolic pressure) of 
about 12 mmHg. Dpdt ma x [sic] (maximum pumping force) is derived 
from the pressure signal using a differentiator module. The heart 
rate was calculated from the pressure signal. The flow rate was 
determined using a drop counter (BMT Messtechnik GmbH Berlin). 

30 After an equilibration time of 20 minutes, the hearts were sub- 
jected to a 30-minute global ischemia by stopping the perfusate 
supply while keeping the temperature at 37°C. During the following 
60-minute reperfusion period, samples of the perfusate were taken 
after 3, 5, 10, 15, 30, 45 and 60 min for analysis of creatine 

35 kinase (CK) activity. Means and standard deviations for the 

measured parameters were analyzed statistically (Dunnett test). 
The significance limit was p=0.05. 

The experiment on rabbit hearts was carried out similarly. Male 
40 white New Zealand rabbits (obtained from: Inter fauna) were used. 
The hearts were prepared as described above for the rat model. 
The perfusion pressure was set at a maximum of 60 mmHg and the 
flow rate at about 25ml/min. The equilibration time was about 
30 min. The substance was administered by infusion directly up- 
45 stream of the heart. 15 min after starting the infusion, a 

30-minute global ischemia was caused by stopping the flow while 
maintaining the temperature of the heart. A 30-minute reperfusion 
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followed. Perfusate was taken for investigation of CK activity 
before administration of the substance, after 15 min and at vari- 
ous times (5, 10, 15, 20, 30 min) during the reperfusion. The 
following parameters were measured: LVP (mmHg), LVEDP, LVdP/dt, 
5 PP (mmHg), HR (pulse rate; beats/min), CK activity (U/min/g heart 
weight) . 

c) Animal model for acute kidney failure 

10 The protective effect of intravenous administration of PARP 
inhibitors (4 days) on the kidney function of rats with 
postischemic acute kidney failure was investigated. 

Male Sprague-Dawley rats (about 330 g at the start of the experi- 
15 ments; breeder: Charles River) were used. 10-15 animals were 
employed per experimental group. Administration of active sub- 
stance/placebo took place continuously with an osmotic micropump 
into the femoral vein. Orbital blood was taken (1.5 ml of whole 
blood) under inhalation anesthesia with enflurane (Ethrane Abbot, 
20 Wiesbaden) . 

After the initial measurements (blood sample) and determination 
of the amount of urine excreted in 24h, the rats were anesthet- 
ized ("Nembutal", pentobarbital sodium, Sanofi CEVA; 50mg/kg 

25 i.p., volume injected 1.0 ml/kg) and fastened on a heatable oper- 
ating table (37°C). 125 IU/kg heparin (Liquemin N, Roche) were 
administered i.v. into the caudal vein. The abdominal cavity was 
opened and the right kidney was exposed. The branching-of f renal 
artery was exposed and clamped off superiorly using bulldog 

30 clamps (Diefenbach 38mm). The left renal artery was likewise ex- 
posed and clamped off (superiorly, about half way to the kidney). 
During the operation, an osmotic micropump was implanted into the 
femoral vein. The intestine was reinserted and the fluid loss was 
compensated with luke-warm 0.9% NaCl. The animals were covered 

35 with a moist cloth and kept warm under red light. After 40 min, 
the appearance of the kidneys was recorded, and the clamps were 
removed, first the right then the left. The intestine was put 
back and 2 drops of antibiotic (Tardomyocel, Bayer) were add 
[sic]. The abdominal wall was closed with sterile cat gut (Ethi- 

40 con No. 4) and treated once more with 1 drop of antibiotic. The 
epidermis was sutured with sterile Ethibond Exel (Ethicon) 
No. 3/0, and the suture was sprayed with Nebacetin N (Yamanouchi) 
wound spray. A tenth of a daily dose of drug/placebo is given as 
i.v. bolus. 
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Samples and blood were taken for investigating biochemical para- 
meters in the serum and urine: Na, K, creatinine, protein (only 
in urine), on days 1, 2 and 4 of the experiment. In addition, the 
feed and water consumption, bodyweight and urine volume were re- 
5 corded. After 14 days, the animals were sacrificed and the 
kidneys were assessed. 

The assessment excluded all animals which died of an infarct dur- 
ing the experiment or showed an infarct at necropsy on day 14. 
10 The creatinine clearance and the fractional sodium excretion were 
calculated as kidney function parameters, comparing treated ani- 
mals with control and sham. 

d) In vitro model for radiosensitization (tumor therapy) 

MCF-7-cells (human breast carcinoma) were cultivated in 
Dulbecco's modified Eagle's medium with 10% heat-inactivated FCS 
and 2 mM L-glutamine. Cells were seeded out overnight in cell 
densities of 100, 1000 or 10,000 cells per well in a 6-well plate 
20 and then exposed to ionizing radiation with a dose in the range 
from 0 to 10 Gy ( 137 Cs, Shepard Mark, model [sic], model I-68A, 
dose rate 3.28 Gy/min) . 10 days after the irradiation, the ex- 
periment was assessed, counting colonies with fifty cells as 
positive . 

25 

Example 8: Preparation of the PARP inhibitor 2(4(2-(N,N- 
diethylamino) eth-l-yloxy) phenyl )benzimidazole-4-carboxamide [sic ] 



30 



35 




a) 4 ( 2-(N,N-Diethylaminoeth-l-yloxy )benzaldehyde [ sic ] 

40 15 g (122 mmol) of 4-hydroxybenzaldehyde, 16.7 g (122 mmol) 

of N-(2-chloroethyl)-N,N-diethylamine and 33.9 g (246 mmol) 
of potassium carbonate were boiled under reflux together with 
a spatula tip of 18-crown-6 in 300 ml of ethyl methyl ketone 
for 6 hours. After filtration, the filtrate was concentrated 

45 in vacuo. The residue was partitioned between ether and 2M 

sodium hydroxide solution, and the ether phase was separated 
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off, dried and concentrated in vacuo. 24.8 g of the inter- 
mediate were obtained. 

b) Ethyl 2 (4 (2-(N,N-diethylamino)eth-l-yloxy ) phenyl )benzimida- 
5 zole-4-carboxylate [sic] 

2 g (11 iranol) of ethyl 2 , 3-diaminobenzoate and 1.4 ml of con- 
centrated acetic acid were dissolved in 25 ml of methanol. 
Then 3.2 g (14.4 iranol) of the intermediate from stage a, dis- 

10 solved in 50 ml of methanol, were added dropwise over the 

course of 30 minutes. Thereafter 2.9 g (14.4 mmol) of 
copper (II) acetate, dissolved in 37.5 ml of warm water, were 
rapidly added dropwise and then the mixture was boiled under 
reflux for 20 minutes. The reaction solution was cooled to 

15 50°C, and 4.5 ml of 32% strength hydrochloric acid were 

added. Then a solution of 4.3 g of sodium sulfide hydrate in 
25 ml of water was cautiously added dropwise, and the mixture 
was stirred for 15 minutes. The reaction solution was poured 
into ice-water, and the resulting precipitate was filtered 

20 off with suction. The filtrate was made alkaline with aqueous 

sodium bicarbonate solution and extracted several times with 
ethyl acetate. The ethyl acetate phase was separated off, 
dried and concentrated in vacuo. 4.4 g of the intermediate 
were obtained . 

25 

c ) 2 ( 4 ( 2- ( N , N-Diethy lamino ) eth-l-yloxy ) phenyl ) benzimidazole- 
4-carbohydrazide [sic] 

2.7 g (54 mmol) of hydrazine hydrate were added to 4.1 g 
30 (10.7 mmol) of the intermediate from stage b in 30 ml of 

ethanol, and the mixture was boiled under reflux for 10 
hours. The organic solvent was then removed in vacuo, and the 
residue was partitioned between water and ethyl acetate. The 
ethyl acetate phase was separated off, dried and concentrated 
35 in vacuo. The resulting residue was then treated with ether 

and again filtered with suction, resulting in 1.7 g of the 
intermediate . 



d ) 2 ( 4 ( 2 - ( N , N-Diethy lamino ) eth- 1 -yloxy ) phenyl ) benzimidazole- 
40 4-carboxamide [sic] 

About 1.6 g of Raney nickel were added to 1.6 g (4.5 mmol) of 
the intermediate from stage c in 45 ml of dimethylf ormamide/ 
water (2/1), and the mixture was heated at 100°C for 6 hours. 
45 The reaction mixture was then filtered, and the filtrate was 
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diluted with a large amount of water, whereupon the product 
precipitated. 1.2 g of the product were obtained. 

iH-NMR (D 6 -DMSO) 5 = 0.95 (6H), 2.6 (4H), 2.8 (2H), 4.1 (2H), 
5 7.1 (2H), 7.3 <1H), 7.7 ( 1H + NH) , 7.85 (1H), 8.2 (2H) and 

9.4 (NH) ppm. 

Example 9: Preparation of the PARP inhibitor 2(3(2-(N,N- 

diethylamino ) eth-l-y loxy ) phenyl ) benzimidazole-4 -carboxamide 
10 [sic] 



15 




20 

a ) 3 ( 2 - ( N , N-Diethy laminoeth- 1-y loxy ) benzaldehyde 

6.1 g (50 mmol) of 3-hydroxybenzaldehyde were dissolved in 
25 100 ml of ethanol, and 3.5 g (50 mmol) of sodium ethanolate 

were added. The mixture was stirred for 15 minutes. Then 
7.5 g (55 mmol) of N-(2-chloroethyl)-N,N-diethylamine were 
added, and the mixture was boiled under reflux for 12 hours. 
The reaction mixture was then concentrated in vacuo. The 
30 residue was partitioned between ether and IN sodium hydroxide 

solution, and the ether phase was separated off, dried and 
concentrated in vacuo. 7.6 g of the intermediate N^Jre ob- 
tained, f [sic] 

35 b) Ethyl 2(3(2- ( N, N-diethylamino ) eth-l-yloxy ) phenyl ) benzimida- 
zole-4 -carboxy late [sic] 

1 g (5.5 mmol) of ethyl 2 , 3-diaminobenzoate and 0.68 ml of 
concentrated acetic acid were dissolved in 20 ml of methanol. 

40 Then 1.6 g (7.2 mmol) of the intermediate from stage a, dis- 

solved in 30 ml of methanol, were added dropwise over the 
course of 30 minutes. Thereafter 1.1 g (5.5 mmol) of 
copper (II) acetate, dissolved in 19 ml of warm water, were 
rapidly added dropwise, and the mixture was then boiled under 

45 reflux for 2 0 minutes. The reaction solution was cooled to 

50°C, and 2.25 ml of 32% strength hydrochloric acid were 
added. Then a solution of 2.13 g of sodium sulfide hydrate in 
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15 ml of water was cautiously added dropwise, and the mixture 
was stirred for 15 minutes. The reaction solution was poured 
into ice-water, and the resulting precipitate was filtered 
off with suction. The filtrate was made alkaline with aqueous 
5 sodium bicarbonate solution and extracted several times with 

ethyl acetate. The ethyl acetate phase was separated off, 
dried and concentrated in vacuo. 2.4 g of the intermediate 
were obtained. 

10 c) 2(3 (2-(N,N-Diethylamino)eth-l-yloxy)phenyl)benzimidazole- 
4-carbohydrazide [sic] 

1.5 g (30 mmol) of hydrazine hydrate were added to 2.3 g 
(6.0 mmol) of the intermediate from stage b in 30 ml of buta- 
15 nol, and the mixture was heated at 120°C for 10 hours. The 

reaction mixture was then extracted with ethyl acetate. The 
ethyl acetate phase was separated off, dried and concentrated 
in vacuo. 1.7 g of the intermediate were obtained. 

20 d) 2(3 ( 2 -N,N-Diethylamino)eth-l-yloxy ) phenyl )benzimidazole- 
4-carboxamide [sic] 

About 1.5 g of Raney nickel were added to 1 g (2.7 mmol) of 
the intermediate from stage c in 30 ml of dimethylf ormamide/ 
25 water (2/1), and the mixture was heated at 100°C for 6 hours. 

The reaction mixture was then filtered, and the filtrate was 
diluted with a large amount of water, whereupon the product 
precipitated. 0.74 g of the product was obtained. 

30 iH-NMR (D 6 -DMS0) d = 1.0 (6H), 2.6 (4H), 2.9 (2H), 4.15 (2H), 

7.1 (1H), 7.4 (1H), 7.5 (1H), 7.7-7.9 (5H) and 9.3 (NH) ppm. 



35 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: BASF Aktiengesellschaf t 

(B) STREET: - 

(C) CITY: Ludwigshaf en 

(E) COUNTRY: Germany 

(F) POSTAL CODE: 67065 

(ii) TITLE OF INVENTION: Novel poly ( ADP-ribose ) polymerase genes 
(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: brain 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 1715 

(D) OTHER INFORMATION: /product= "Poly ADP Ribose 
Polymerase" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CC ATG GCG GCG CGG CGG CGA CGG AGC ACC GGC GGC GGC AGG GCG AGA 
Met Ala Ala Arg Arg Arg Arg Ser Thr Gly Gly Gly Arg Ala Arg 
15 10 15 

GCA TTA AAT GAA AGC AAA AGA GTT AAT AAT GGC AAC ACG GCT CCA GAA 
Ala Leu Asn Glu Ser Lys Arg Val Asn Asn Gly Asn Thr Ala Pro Glu 

20 25 30 

GAC TCT TCC CCT GCC AAG AAA ACT CGT AGA TGC CAG AGA CAG GAG TCG 
Asp Ser Ser Pro Ala Lys Lys Thr Arg Arg Cys Gin Arg Gin Glu Ser 
35 40 45 

AAA AAG ATG CCT GTG GCT GGA GGA AAA GCT AAT AAG GAC AGG ACA GAA 
Lys Lys Met. Pro Val Ala Gly Gly Lys Ala Asn Lys Asp Arg Thr Glu 
50 55 60 

GAC AAG CAA GAT GAA TCT GTG AAG GCC TTG CTG TTA AAG GGC AAA GCT 
Asp Lys Gin Asp Glu Ser Val Lys Ala Leu Leu Leu Lys Gly Lys Ala 
65 70 75 

CCT GTG GAC CCA GAG TGT ACA GCC AAG GTG GGG AAG GCT CAT GTG TAT 
Pro Val Asp Pro Glu Cys Thr Ala Lys Val Gly Lys Ala His Val Tyr 
80 85 90 95 

TGT GAA GGA AAT GAT GTC TAT GAT GTC ATG CTA AAT CAG ACC AAT CTC 
Cys Glu Gly Asn Asp Val Tyr Asp Val Met Leu Asn Gin Thr Asn Leu 

100 105 110 

CAG TTC AAC AAC AAC AAG TAC TAT CTG ATT CAG CTA TTA GAA GAT GAT 
Gin Phe Asn Asn Asn Lys Tyr Tyr Leu lie Gin Leu Leu Glu Asp Asp 
115 120 125 

GCC CAG AGG AAC TTC AGT GTT TGG ATG AGA TGG GGC CGA GTT GGG AAA 
Ala Gin Arg Asn Phe Ser Val Trp Met Arg Trp Gly Arg Val Gly Lys 
130 135 140 

ATG GGA CAG CAC AGC CTG GTG GCT TGT TCA GGC AAT CTC AAC AAG GCC 
Met Gly Gin His Ser Leu Val Ala Cys Ser Gly Asn Leu Asn Lys Ala 
145 150 155 

AAG GAA ATC TTT CAG AAG AAA TTC CTT GAC AAA ACG AAA AAC AAT TGG 
Lys Glu lie Phe Gin Lys Lys Phe Leu Asp Lys Thr Lys Asn Asn Trp 
160 165 170 175 



GAA GAT CGA GAA AAG TTT GAG AAG GTG CCT GGA AAA TAT GAT ATG CTA 
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Glu Asp Arg Glu Lys Phe Glu Lys Val Pro Gly Lys Tyr Asp Met Leu 

180 185 190 

CAG ATG GAC TAT GCC ACC AAT ACT CAG GAT GAA GAG GAA ACA AAG AAA 
Gin Met Asp Tyr Ala Thr Asn Thr Gin Asp Glu Glu Glu Thr Lys Lys 
195 200 205 

GAG GAA TCT CTT AAA TCT CCC TTG AAG CCA GAG TCA CAG CTA GAT CTT 
Glu Glu Ser Leu Lys Ser Pro Leu Lys Pro Glu Ser Gin Leu Asp Leu 
210 215 220 

CGG GTA CAG GAG TTA ATA AAG TTG ATC TGT AAT GTT CAG GCC ATG GAA 
Arg Val Gin Glu Leu lie Lys Leu lie Cys Asn Val Gin Ala Met Glu 
225 230 235 

GAA ATG ATG ATG GAA ATG AAG TAT AAT ACC AAG AAA GCC CCA CTT GGG 
Glu Met Met Met Glu Met Lys Tyr Asn Thr Lys Lys Ala Pro Leu Gly 
240 245 250 255 

AAG CTG ACA GTG GCA CAA ATC AAG GCA GGT TAC CAG TCT CTT AAG AAG 
Lys Leu Thr Val Ala Gin lie Lys Ala Gly Tyr Gin Ser Leu Lys Lys 

260 265 270 

ATT GAG GAT TGT ATT CGG GCT GGC CAG CAT GGA CGA GCT CTC ATG GAA 
lie Glu Asp Cys lie Arg Ala Gly Gin His Gly Arg Ala Leu Met Glu 
275 280 285 

GCA TGC AAT GAA TTC TAC ACC AGG ATT CCG CAT GAC TTT GGA CTC CGT 
Ala Cys Asn Glu Phe Tyr Thr Arg lie Pro His Asp Phe Gly Leu Arg 
290 295 300 

ACT CCT CCA CTA ATC CGG ACA CAG AAG GAA CTG TCA GAA AAA ATA CAA 
Thr Pro Pro Leu lie Arg Thr Gin Lys Glu Leu Ser Glu Lys lie Gin 
305 310 315 

TTA CTA GAG GCT TTG GGA GAC ATT GAA ATT GCT ATT AAG CTG GTG AAA 
Leu Leu Glu Ala Leu Gly Asp lie Glu lie Ala lie Lys Leu Val Lys 
320 325 330 335 

ACA GAG CTA CAA AGC CCA GAA CAC CCA TTG GAC CAA CAC TAT AGA AAC 
Thr Glu Leu Gin Ser Pro Glu His Pro Leu Asp Gin His Tyr Arg Asn 

340 345 350 

CTA CAT TGT GCC TTG CGC CCC CTT GAC CAT GAA AGT TAC GAG TTC AAA 
Leu His Cys Ala Leu Arg Pro Leu Asp His Glu Ser Tyr Glu Phe Lys 
355 360 365 
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GTG ATT TCC CAG TAC CTA CAA TCT ACC CAT GCT CCC ACA CAC AGC GAC 1151 

Val lie Ser Gin Tyr Leu Gin Ser Thr His Ala Pro Thr His Ser Asp 
370 375 380 

TAT ACC ATG ACC TTG CTG GAT TTG TTT GAA GTG GAG AAG GAT GGT GAG 1199 

Tyr Thr Met Thr Leu Leu Asp Leu Phe Glu Val Glu Lys Asp Gly Glu 
385 390 395 

AAA GAA GCC TTC AGA GAG GAC CTT CAT AAC AGG ATG CTT CTA TGG CAT 1247 

Lys Glu Ala Phe Arg Glu Asp Leu His Asn Arg Met Leu Leu Trp His 
400 405 410 415 

GGT TCC AGG ATG AGT AAC TGG GTG GGA ATC TTG AGC CAT GGG CTT CGA 1295 

Gly Ser Arg Met Ser Asn Trp Val Gly lie Leu Ser His Gly Leu Arg 

420 425 430 

ATT GCC CCA CCT GAA GCT CCC ATC ACA GGT TAC ATG TTT GGG AAA GGA 1343 

lie Ala Pro Pro Glu Ala Pro lie Thr Gly Tyr Met Phe Gly Lys Gly 

435 440 445 

ATC TAC TTT GCT GAC ATG TCT TCC AAG AGT GCC AAT TAC TGC TTT GCC 1391 

lie Tyr Phe Ala Asp Met Ser Ser Lys Ser Ala Asn Tyr Cys Phe Ala 
450 455 460 

TCT CGC CTA AAG AAT ACA GGA CTG CTG CTC TTA TCA GAG GTA GCT CTA 1439 

Ser Arg Leu Lys Asn Thr Gly Leu Leu Leu Leu Ser Glu Val Ala Leu 
465 470 475 

GGT CAG TGT AAT GAA CTA CTA GAG GCC AAT CCT AAG GCC GAA GGA TTG 1487 

Gly Gin Cys Asn Glu Leu Leu Glu Ala Asn Pro Lys Ala Glu Gly Leu 
480 485 490 495 

CTT CAA GGT AAA CAT AGC ACC AAG GGG CTG GGC AAG ATG GCT CCC AGT 1535 

Leu Gin Gly Lys His Ser Thr Lys Gly Leu Gly Lys Met Ala Pro Ser 

500 505 510 

TCT GCC CAC TTC GTC ACC CTG AAT GGG AGT ACA GTG CCA TTA GGA CCA 1583 

Ser Ala His Phe Val Thr Leu Asn Gly Ser Thr Val Pro Leu Gly Pro 

515 520 525 

GCA AGT GAC ACA GGA ATT CTG AAT CCA GAT GGT TAT ACC CTC AAC TAC 1631 

Ala Ser Asp Thr Gly lie Leu Asn Pro Asp Gly Tyr Thr Leu Asn Tyr 
530 535 540 

AAT GAA TAT ATT GTA TAT . AAC CCC AAC CAG GTC CGT ATG CGG TAC CTT 167 9 

Asn Glu Tyr lie Val Tyr Asn Pro Asn Gin Val Arg Met Arg Tyr Leu 
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545 550 555 

TTA AAG GTT CAG TTT AAT TTC CTT CAG CTG TGG TGA ATGTTGATAT 1725 
Leu Lys Val Gin Phe Asn Phe Leu Gin Leu Trp * 
560 565 570 

TAAATAAACC AGAGATCTGA TCTTCAAGCA AGAAAATAAG CAGTGTTGTA CTTGTGAATT 17 85 

TTGTGATATT TTATGTAATA AAAAC TGTAC AGGTCTAAAA AAAAAAAAAA AAAAAAAA 1843 
(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ala Arg Arg Arg Arg Ser Thr Gly Gly Gly Arg Ala Arg Ala 
1 5 10 15 

Leu Asn Glu Ser Lys Arg Val Asn Asn Gly Asn Thr Ala Pro Glu Asp 
20 25 30 

Ser Ser Pro Ala Lys Lys Thr Arg Arg Cys Gin Arg Gin Glu Ser Lys 
35 40 45 

Lys Met Pro Val Ala Gly Gly Lys Ala Asn Lys Asp Arg Thr Glu Asp 
5Q 55 60 

Lys Gin Asp Glu Ser Val Lys Ala Leu Leu Leu Lys Gly Lys Ala Pro 
65 70 75 80 

Val Asp Pro Glu Cys Thr Ala Lys Val Gly Lys Ala His Val Tyr Cys 

85 90 95 

Glu Gly Asn Asp Val Tyr Asp Val Met Leu Asn Gin Thr Asn Leu Gin 
100 105 110 



Phe Asn Asn Asn Lys Tyr Tyr Leu lie Gin Leu Leu Glu Asp Asp Ala 
115 120 125 



Gin Arg Asn Phe Ser Val Trp Met Arg Trp Gly Arg Val Gly Lys Met 
130 135 140 
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Gly Gin His Ser Leu Val Ala Cys Ser Gly Asn Leu Asn Lys Ala Lys 
145 150 155 160 

Glu lie Phe Gin Lys Lys Phe Leu Asp Lys Thr Lys Asn Asn Trp Glu 

165 170 175 

Asp Arg Glu Lys Phe Glu Lys Val Pro Gly Lys Tyr Asp Met Leu Gin 
180 185 190 

Met Asp Tyr Ala Thr Asn Thr Gin Asp Glu Glu Glu Thr Lys Lys Glu 
195 200 205 

Glu Ser Leu Lys Ser Pro Leu Lys Pro Glu Ser Gin Leu Asp Leu Arg 
210 215 220 

Val Gin Glu Leu lie Lys Leu lie Cys Asn Val Gin Ala Met Glu Glu 
225 230 235 240 

Met Met Met Glu Met Lys Tyr Asn Thr Lys Lys Ala Pro Leu Gly Lys 

245 250 255 

Leu Thr Val Ala Gin lie Lys Ala Gly Tyr Gin Ser Leu Lys Lys lie 
260 265 270 

Glu Asp Cys lie Arg Ala Gly Gin His Gly Arg Ala Leu Met Glu Ala 
275 280 285 

Cys Asn Glu Phe Tyr Thr Arg lie Pro His Asp Phe Gly Leu Arg Thr 
290 295 300 



Pro Pro Leu lie 
305 

Leu Glu Ala Leu 



Glu Leu Gin Ser 
340 

His Cys Ala Leu 
355 

lie Ser Gin Tyr 
370 



Arg Thr Gin Lys 
310 

Gly Asp lie Glu 
325 

Pro Glu His Pro 



Arg Pro Leu Asp 
360 

Leu Gin Ser Thr 
375 



Glu Leu Ser Glu 
315 

lie Ala lie Lys 
330 

Leu Asp Gin His 
345 

His Glu Ser Tyr 



His Ala Pro Thr 
380 



Lys lie Gin Leu 
320 

Leu Val Lys Thr 
335 

Tyr Arg Asn Leu 
350 

Glu Phe Lys Val 
365 

His Ser Asp Tyr 



Thr Met Thr Leu Leu Asp Leu Phe Glu Val Glu Lys Asp Gly Glu Lys 
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385 390 395 400 

Glu Ala Phe Arg Glu Asp Leu His Asn Arg Met Leu Leu Trp His Gly 

405 410 415 

Ser Arg Met Ser Asn Trp Val Gly lie Leu Ser His Gly Leu Arg lie 
420 425 430 

Ala Pro Pro Glu Ala Pro lie Thr Gly Tyr Met Phe Gly Lys Gly lie 
435 440 445 

Tyr Phe Ala Asp Met Ser Ser Lys Ser Ala Asn Tyr Cys Phe Ala Ser 
450 455 460 

Arg Leu Lys Asn Thr Gly Leu Leu Leu Leu Ser Glu Val Ala Leu Gly 
465 470 475 480 

Gin Cys Asn Glu Leu Leu Glu Ala Asn Pro Lys Ala Glu Gly Leu Leu 

485 490 495 

Gin Gly Lys His Ser Thr Lys Gly Leu Gly Lys Met Ala Pro Ser Ser 
500 505 510 

Ala His Phe Val Thr Leu Asn Gly Ser Thr Val Pro Leu Gly Pro Ala 
515 520 525 

Ser Asp Thr Gly lie Leu Asn Pro Asp Gly Tyr Thr Leu Asn Tyr Asn 
530 535 540 

Glu Tyr lie Val Tyr Asn Pro Asn Gin Val Arg Met Arg Tyr Leu Leu 
545 550 555 560 

Lys Val Gin Phe Asn Phe Leu Gin Leu Trp * 

565 570 

(2) INFORMATION FOR SEQ ID NO: 3: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE : NO " 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: uterus 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 242.. 1843 

(D) OTHER INFORMATION : /product^ "Poly ADP Ribose 
Polymerase" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TGGGACTGGT CGCCTGACTC GGCCTGCCCC AGCCTCTGCT TCACCCCACT GGTGGCCAAA 60 

TAGCCGATGT CTAATCCCCC AC AC AAGC TC ATCCCCGGCC TCTGGGATTG TTGGGAATTC 120 

TCTCCCTAAT TCACGCCTGA GGCTCATGGA GAGTTGCTAG AC C TGGG AC T GCCCTGGGAG 180 

GC GC AC AC AA CCAGGCCGGG TGGCAGCCAG GACCTCTCCC ATGTCCCTGC TTTTCTTGGC 240 

C ATG GCT CCA AAG CCG AAG CCC TGG GTA CAG ACT GAG GGC CCT GAG 286 
Met Ala Pro Lys Pro Lys Pro Trp Val Gin Thr Glu Gly Pro Glu 
575 580 585 

AAG AAG AAG GGC CGG CAG GCA GGA AGG GAG GAG GAC CCC TTC CGC TCC 334 
Lys Lys Lys Gly Arg Gin Ala Gly Arg Glu Glu Asp Pro Phe Arg Ser 
590 595 600 

ACC GCT GAG GCC CTC AAG GCC ATA CCC GCA GAG AAG CGC ATA ATC CGC 382 
Thr Ala Glu Ala Leu Lys Ala lie Pro Ala Glu Lys Arg lie lie Arg 
605 610 615 

GTG GAT CCA ACA TGT CCA CTC AGC AGC AAC CCC GGG ACC CAG GTG TAT 4 30 

Val Asp Pro Thr Cys Pro Leu Ser Ser Asn Pro Gly Thr Gin Val Tyr 
620 625 630 

GAG GAC TAC AAC TGC ACC CTG AAC CAG ACC AAC ATC GAG AAC AAC AAC 478 
Glu Asp Tyr Asn Cys Thr Leu Asn Gin Thr Asn lie Glu Asn Asn Asn 
635 640 645 650 

AAC AAG TTC TAC ATC ATC CAG CTG CTC CAA GAC AGC AAC CGC TTC TTC 526 
Asn Lys Phe Tyr lie lie Gin Leu Leu Gin Asp Ser Asn Arg Phe Phe 

655 660 665 



ACC TGC TGG AAC CGC TGG GGC CGT GTG GGA GAG GTC GGC CAG TCA AAG 
Thr Cys Trp Asn Arg Trp Gly Arg Val Gly Glu Val Gly Gin Ser Lys 



574 
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670 675 680 

ATC AAC CAC TTC AC A AGG CTA GAA GAT GCA AAG AAG GAC TTT GAG AAG 622 
lie Asn His Phe Thr Arg Leu Glu Asp Ala Lys Lys Asp Phe Glu Lys 
685 690 695 

AAA TTT CGG GAA AAG ACC AAG AAC AAC TGG GCA GAG CGG GAC CAC TTT 67 0 

Lys Phe Arg Glu Lys Thr Lys Asn Asn Trp Ala Glu Arg Asp His Phe 
700 705 710 

GTG TCT CAC CCG GGC AAG TAC ACA CTT ATC GAA GTA CAG GCA GAG GAT 718 
Val Ser His Pro Gly Lys Tyr Thr Leu lie Glu Val Gin Ala Glu Asp 
715 720 725 730 

GAG GCC CAG GAA GCT GTG GTG AAG GTG GAC AGA GGC CCA GTG AGG ACT 766 
Glu Ala Gin Glu Ala Val Val Lys Val Asp Arg Gly Pro Val Arg Thr 

735 740 745 

GTG ACT AAG CGG GTG CAG CCC TGC TCC CTG GAC CCA GCC ACG CAG AAG 814 
Val Thr Lys Arg Val Gin Pro Cys Ser Leu Asp Pro Ala Thr Gin Lys 
750 755 760 

CTC ATC ACT AAC ATC TTC AGC AAG GAG ATG TTC AAG AAC ACC ATG GCC 862 
Leu lie Thr Asn lie Phe Ser Lys Glu Met Phe Lys Asn Thr Met Ala 
765 770 775 

CTC ATG GAC CTG GAT GTG AAG AAG ATG CCC CTG GGA AAG CTG AGC AAG 910 
Leu Met Asp Leu Asp Val Lys Lys Met Pro Leu Gly Lys Leu Ser Lys 
780 785 790 

CAA CAG ATT GCA CGG GGT TTC GAG GCC TTG GAG GCG CTG GAG GAG GCC 958 
Gin Gin lie Ala Arg Gly Phe Glu Ala Leu Glu Ala Leu Glu Glu Ala 
795 800 805 810 . 

CTG AAA GGC CCC ACG GAT GGT GGC CAA AGC CTG GAG GAG CTG TCC TCA 1006 
Leu Lys Gly Pro Thr Asp Gly Gly Gin Ser Leu Glu Glu Leu Ser Ser 

815 820 825 

CAC TTT TAC ACC GTC ATC CCG CAC AAC TTC GGC CAC AGC CAG CCC CCG 1054 
His Phe Tyr Thr Val lie Pro His Asn Phe Gly His Ser Gin Pro Pro 
830 835 840 

CCC ATC AAT TCC CCT GAG CTT CTG CAG GCC AAG AAG GAC ATG CTG CTG 1102 
Pro lie Asn Ser Pro Glu Leu Leu Gin Ala Lys Lys Asp Met Leu Leu 
845 850 855 

GTG CTG GCG GAC ATC GAG CTG GCC CAG GCC CTG CAG GCA GTC TCT GAG 1150 
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Val Leu Ala Asp lie Glu Leu Ala Gin Ala Leu Gin Ala Val Ser Glu 
860 865 870 

CAG GAG AAG ACG GTG GAG GAG GTG CCA CAC CCC CTG GAC CGA GAC TAC 1198 
Gin Glu Lys Thr Val Glu Glu Val Pro His Pro Leu Asp Arg Asp Tyr 
875 880 885 890 

CAG CTT CTC AAG TGC CAG CTG CAG CTG CTA GAC TCT GGA GCA CCT GAG 1246 
Gin Leu Leu Lys Cys Gin Leu Gin Leu Leu Asp Ser Gly Ala Pro Glu 

895 900 905 

TAC AAG GTG ATA CAG ACC TAC TTA GAA CAG ACT GGC AGC AAC CAC AGG 1294 
Tyr Lys Val lie Gin Thr Tyr Leu Glu Gin Thr Gly Ser Asn His Arg 
910 915 920 

TGC CCT ACA CTT CAA CAC ATC TGG AAA GTA AAC CAA GAA GGG GAG GAA 1342 
Cys Pro Thr Leu Gin His lie Trp Lys Val Asn Gin Glu Gly Glu Glu 
925 930 935 

GAC AGA TTC CAG GCC CAC TCC AAA CTG GGT AAT CGG AAG CTG CTG TGG 1390 
Asp Arg Phe Gin Ala His Ser Lys Leu Gly Asn Arg Lys Leu Leu Trp 
940 945 950 

CAT GGC ACC AAC ATG GCC GTG GTG GCC GCC ATC CTC ACT AGT GGG CTC 1438 
His Gly Thr Asn Met Ala Val Val Ala Ala lie Leu Thr Ser Gly Leu 
955 960 965 970 

CGC ATC ATG CCA CAT TCT GGT GGG CGT GTT GGC AAG GGC ATC TAC TTT 1486 
Arg lie Met Pro His Ser Gly Gly Arg Val Gly Lys Gly lie Tyr Phe 

975 980 985 

GCC TCA GAG AAC AGC AAG TCA GCT GGA TAT GTT ATT GGC ATG AAG TGT 1534 
Ala Ser Glu Asn Ser Lys Ser Ala Gly Tyr Val lie Gly Met Lys Cys 
990 995 1000 

GGG GCC CAC CAT GTC GGC TAC ATG TTC CTG GGT GAG GTG GCC CTG GGC 1582 
Gly Ala His His Val Gly Tyr Met Phe Leu Gly Glu Val Ala Leu Gly 
1005 1010 1015 

AGA GAG CAC CAT ATC AAC ACG GAC AAC CCC AGC TTG AAG AGC CCA CCT 1630 
Arg Glu His His lie Asn Thr Asp Asn Pro Ser Leu Lys Ser Pro Pro 
1020 1025 1030 

CCT GGC TTC GAC AGT GTC ATT GCC CGA GGC CAC ACC GAG CCT GAT CCG 1678 
Pro Gly Phe Asp Ser Val lie Ala Arg Gly His Thr Glu Pro Asp Pro 
1035 1040 1045 1050 
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ACC CAG GAC ACT GAG TTG GAG CTG GAT GGC CAG CAA GTG GTG GTG CCC 1726 
Thr Gin Asp Thr Glu Leu Glu Leu Asp Gly Gin Gin Val Val Val Pro 

1055 1060 1065 

CAG GGC CAG CCT GTG CCC TGC CCA GAG TTC AGC AGC TCC AC A TTC TCC 1774 
Gin Gly Gin Pro Val Pro Cys Pro Glu Phe Ser Ser Ser Thr Phe Ser 
1070 1075 1080 

CAG AGC GAG TAC CTC ATC TAC CAG GAG AGC CAG TGT CGC CTG CGC TAC 1822 
Gin Ser Glu Tyr Leu lie Tyr Gin Glu Ser Gin Cys Arg Leu Arg Tyr 
1085 1090 1095 

CTG CTG GAG GTC CAC CTC TGA GTGCCCGCCC TGTCCCCCGG GGTCCTGCAA 1873 
Leu Leu Glu Val His Leu * 

1100 1105 

GGCTGGACTG TGATCTTCAA TCATCCTGCC CATCTCTGGT ACCCCTATAT CACTCCTTTT 1933 

TTTCAAGAAT ACAATACGTT GTTGTTAACT ATAGTCACCA TGCTGTACAA GATCCCTGAA 1993 

CTTATGCCTC C T AAC TG AAA TTTTGTATTC TTTGACACAT CTGCCCAGTC CCTCTCCTCC 2053 

CAGCCCATGG TAAC C AGC AT TTGACTCTTT ACTTGTATAA GGGCAGCTTT TATAGGTTCC 2113 

ACATGTAAGT GAGATCATGC AGTGTTTGTC TTTCTGTGCC TGGCTTATTT CACTCAGCAT 217 3 

AATGTGCACC GGGTTCACCC ATGTTTTCAT AAATGACAAG ATTTCCTCCT TTAAAAAAAA 2233 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2265 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro Lys Pro Lys Pro Trp Val Gin Thr Glu Gly Pro Glu Lys 
15 10 15 

Lys Lys Gly Arg Gin Ala Gly Arg Glu Glu Asp Pro Phe Arg Ser Thr 
20 25 30 

Ala Glu Ala Leu Lys Ala lie Pro Ala Glu Lys Arg lie lie Arg Val 
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35 40 

Asp Pro Thr Cys Pro Leu Ser Ser 
50 55 

Asp Tyr Asn Cys Thr Leu Asn Gin 
65 70 

Lys Phe Tyr lie lie Gin Leu Leu 

85 



45 

Asn Pro Gly Thr Gin Val Tyr Glu 
60 

Thr Asn lie Glu Asn Asn Asn Asn 
75 80 

Gin Asp Ser Asn Arg Phe Phe Thr 
90 95 



Cys Trp Asn Arg Trp Gly Arg Val 
100 

Asn His Phe Thr Arg Leu Glu Asp 
115 120 

Phe Arg Glu Lys Thr Lys Asn Asn 
130 135 

Ser His Pro Gly Lys Tyr Thr Leu 
145 150 



Gly Glu Val Gly Gin Ser Lys lie 
105 110 

Ala Lys Lys Asp Phe Glu Lys Lys 

125 

Trp Ala Glu Arg Asp His Phe Val 
140 

lie Glu Val Gin Ala Glu Asp Glu 
155 160 



Ala Gin Glu Ala 



Thr Lys Arg Val 
180 

lie Thr Asn lie 
195 

Met Asp Leu Asp 
210 

Gin lie Ala Arg 

225 

Lys Gly Pro Thr 



Phe Tyr Thr Val 
260 

lie Asn Ser Pro 
275 



Val Val Lys Val 
165 

Gin Pro Cys Ser 



Phe Ser Lys Glu 
200 

Val Lys Lys Met 
215 

Gly Phe Glu Ala 
230 

Asp Gly Gly Gin 
245 

lie Pro His Asn 



Glu Leu Leu Gin 
280 



Asp Arg Gly Pro 
170 

Leu Asp Pro Ala 
185 

Met Phe Lys Asn 



Pro Leu Gly Lys 
220 

Leu Glu Ala Leu 
235 

Ser Leu Glu Glu 
250 

Phe Gly His Ser 
265 

Ala Lys Lys Asp 



Val Arg Thr Val 
175 

Thr Gin Lys Leu 
190 

Thr Met Ala Leu 
205 

Leu Ser Lys Gin 



Glu Glu Ala Leu 
240 

Leu Ser Ser His 
255 

Gin Pro Pro Pro 
270 

Met Leu Leu Val 
285 
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Leu Ala Asp lie Glu Leu Ala Gin Ala Leu Gin Ala Val Ser Glu Gin 
290 295 300 

Glu Lys Thr Val Glu Glu Val Pro His Pro Leu Asp Arg Asp Tyr Gin 
305 310 315 320 

Leu Leu Lys Cys Gin Leu Gin Leu Leu Asp Ser Gly Ala Pro Glu Tyr 

325 330 335 

Lys Val lie Gin Thr Tyr Leu Glu Gin Thr Gly Ser Asn His Arg Cys 
340 345 350 

Pro Thr Leu Gin His lie Trp Lys Val Asn Gin Glu Gly Glu Glu Asp 
355 360 365 

Arg Phe Gin Ala His Ser Lys Leu Gly Asn Arg Lys Leu Leu Trp His 
370 375 380 

Gly Thr Asn Met Ala Val Val Ala Ala lie Leu Thr Ser Gly Leu Arg 
385 390 395 400 

lie Met Pro His Ser Gly Gly Arg Val Gly Lys Gly lie Tyr Phe Ala 

405 410 415 

Ser Glu Asn Ser Lys Ser Ala Gly Tyr Val lie Gly Met Lys Cys Gly 
420 425 430 

Ala His His Val Gly Tyr Met Phe Leu Gly Glu Val Ala Leu Gly Arg 
435 440 445 

Glu His His lie Asn Thr Asp Asn Pro Ser Leu Lys Ser Pro Pro Pro 
450 455 460 

Gly Phe Asp Ser Val lie Ala Arg Gly His Thr Glu Pro Asp Pro Thr 
465 470 475 480 

Gin Asp Thr Glu Leu Glu Leu Asp Gly Gin Gin Val Val Val Pro Gin 

485 490 495 

Gly Gin Pro Val Pro Cys Pro Glu Phe Ser Ser Ser Thr Phe Ser Gin 
500 505 510 

Ser Glu Tyr Leu lie Tyr Gin Glu Ser Gin Cys Arg Leu Arg Tyr Leu 
515 520 525 



Leu Glu Val His Leu * 
530 



59 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 65 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(vi) ORIGINAL SOURCE: 

(F) TISSUE TYPE: uterus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 221.. 184 3 

(D) OTHER INFORMATION: /product= "Poly ADP Ribose 
Polymerase" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



TGGGAC TGGT CGCCTGACTC GGCCTGCCCC AGCCTCTGCT TCACCCCACT GGTGGCCAAA 60 

TAGCCGATGT CTAATCCCCC ACACAAGCTC ATCCCCGGCC TCTGGGATTG TTGGGAATTC 12 0 

TCTCCCTAAT TCACGCCTGA GGCTCATGGA GAGTTGCTAG AC C TGGGAC T GCCCTGGGAG 180 

GCGCACACAA CCAGGCCGGG TGGCAGCCAG GACCTCTCCC ATG TCC CTG CTT TTC 23 5 

Met Ser Leu Leu Phe 
535 



TTG GCC ATG GCT CCA AAG CCG AAG CCC TGG GTA CAG ACT GAG GGC CCT 2 83 

Leu Ala Met Ala Pro Lys Pro Lys Pro Trp Val Gin Thr Glu Gly Pro 
540 545 550 555 

GAG AAG AAG AAG GGC CGG CAG GCA GGA AGG GAG GAG GAC CCC TTC CGC 331 
Glu Lys Lys Lys Gly Arg Gin Ala Gly Arg Glu Glu Asp Pro Phe Arg 

560 565 570 



TCC ACC GCT GAG GCC CTC AAG GCC ATA CCC GCA GAG AAG CGC ATA ATC 
Ser Thr Ala Glu Ala Leu Lys Ala lie Pro Ala Glu Lys Arg lie lie 
575 580 585 
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CGC GTG GAT CCA ACA TGT CCA CTC AGC AGC AAC CCC GGG ACC CAG GTG 
Arg Val Asp Pro Thr Cys Pro Leu Ser Ser Asn Pro Gly Thr Gin Val 
590 595 600 

TAT GAG GAC TAC AAC TGC ACC CTG AAC CAG ACC AAC ATC GAG AAC AAC 
Tyr Glu Asp Tyr Asn Cys Thr Leu Asn Gin Thr Asn lie Glu Asn Asn 
605 610 615 

AAC AAC AAG TTC TAC ATC ATC CAG CTG CTC CAA GAC AGC AAC CGC TTC 
Asn Asn Lys Phe Tyr lie lie Gin Leu Leu Gin Asp Ser Asn Arg Phe 
620 625 630 635 

TTC ACC TGC TGG AAC CGC TGG GGC CGT GTG GGA GAG GTC GGC CAG TCA 
Phe Thr Cys Trp Asn Arg Trp Gly Arg Val Gly Glu Val Gly Gin Ser 

640 645 650 

AAG ATC AAC CAC TTC ACA AGG CTA GAA GAT GCA AAG AAG GAC TTT GAG 
Lys lie Asn His Phe Thr Arg Leu Glu Asp Ala Lys Lys Asp Phe Glu 
655 660 665 

AAG AAA TTT CGG GAA AAG ACC AAG AAC AAC TGG GCA GAG CGG GAC CAC 
Lys Lys Phe Arg Glu Lys Thr Lys Asn Asn Trp Ala Glu Arg Asp His 
670 675 680 

TTT GTG TCT CAC CCG GGC AAG TAC ACA CTT ATC GAA GTA CAG GCA GAG 
Phe Val Ser His Pro Gly Lys Tyr Thr Leu lie Glu Val Gin Ala Glu 
685 690 695 

GAT GAG GCC CAG GAA GCT GTG GTG AAG GTG GAC AGA GGC CCA GTG AGG 
Asp Glu Ala Gin Glu Ala Val Val Lys Val Asp Arg Gly Pro Val Arg 
700 705 710 715 

ACT GTG ACT AAG CGG GTG CAG CCC TGC TCC CTG GAC CCA GCC ACG CAG 
Thr Val Thr Lys Arg Val Gin Pro Cys Ser Leu Asp Pro Ala Thr Gin 

720 725 730 

AAG CTC ATC ACT AAC ATC TTC AGC AAG GAG ATG TTC AAG AAC ACC ATG 
Lys Leu lie Thr Asn lie Phe Ser Lys Glu Met Phe Lys Asn Thr Met 
735 740 745 

GCC CTC ATG GAC CTG GAT GTG AAG AAG ATG CCC CTG GGA AAG CTG AGC 
Ala Leu Met Asp Leu Asp Val Lys Lys Met Pro Leu Gly Lys Leu Ser 
750 755 760 



AAG CAA CAG ATT GCA CGG GGT TTC GAG GCC TTG GAG GCG CTG GAG GAG 
Lys Gin Gin lie Ala Arg Gly Phe Glu Ala Leu Glu Ala Leu Glu Glu 
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765 770 775 

GCC CTG AAA GGC CCC ACG GAT GGT GGC CAA AGC CTG GAG GAG CTG TCC 1003 
Ala Leu Lys Gly Pro Thr Asp Gly Gly Gin Ser Leu Glu Glu Leu Ser 
780 785 790 795 

TCA CAC TTT TAC ACC GTC ATC CCG CAC AAC TTC GGC CAC AGC CAG CCC 1051 
Ser His Phe Tyr Thr Val lie Pro His Asn Phe Gly His Ser Gin Pro 

800 805 810 

CCG CCC ATC AAT TCC CCT GA<? CTT CTG CAG GCC AAG AAG GAC ATG CTG 1099 
Pro Pro lie Asn Ser Pro Glu Leu Leu Gin Ala Lys Lys Asp Met Leu 
815 820 825 

CTG GTG CTG GCG GAC ATC GAG CTG GCC CAG GCC CTG CAG GCA GTC TCT 1147 
Leu Val Leu Ala Asp lie Glu Leu Ala Gin Ala Leu Gin Ala Val Ser 
830 835 840 

GAG CAG GAG AAG ACG GTG GAG GAG GTG CCA CAC CCC CTG GAC CGA GAC 1195 
Glu Gin Glu Lys Thr Val Glu Glu Val Pro His Pro Leu Asp Arg Asp 
845 850 855 

TAC CAG CTT CTC AAG TGC CAG CTG CAG CTG CTA GAC TCT GGA GCA CCT 12 43 

Tyr Gin Leu Leu Lys Cys Gin Leu Gin Leu Leu Asp Ser Gly Ala Pro 
860 865 870 875 

GAG TAC AAG GTG ATA CAG ACC TAC TTA GAA CAG ACT GGC AGC AAC CAC 12 91 

Glu Tyr Lys Val lie Gin Thr Tyr Leu Glu Gin Thr Gly Ser Asn His 

880 885 890 

AGG TGC CCT ACA CTT CAA CAC ATC TGG AAA GTA AAC CAA GAA GGG GAG 1339 
Arg Cys Pro Thr Leu Gin His lie Trp Lys Val Asn Gin Glu Gly Glu 
895 900 905 

GAA GAC AGA TTC CAG GCC CAC TCC AAA CTG GGT AAT CGG AAG CTG CTG 1387 
Glu Asp Arg Phe Gin Ala His Ser Lys Leu Gly Asn Arg Lys Leu Leu 
910 915 920 

TGG CAT GGC ACC AAC ATG GCC GTG GTG GCC GCC ATC CTC ACT AGT GGG 1435 
Trp His Gly Thr Asn Met Ala Val Val Ala Ala lie Leu Thr Ser Gly 
925 930 935 

CTC CGC ATC ATG CCA CAT TCT GGT GGG CGT GTT GGC AAG GGC ATC TAC 14 83 

Leu Arg lie Met Pro His Ser Gly Gly Arg Val Gly Lys Gly lie Tyr 
940 945 950 955 

TTT GCC TCA GAG AAC AGC AAG TCA GCT GGA TAT GTT ATT GGC ATG AAG 1531 
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Phe Ala Ser Glu Asn Ser Lys Ser Ala Gly Tyr Val lie Gly Met Lys 

960 965 970 

TGT GGG GCC CAC CAT GTC GGC TAC ATG TTC CTG GGT GAG GTG GCC CTG 1579 
Cys Gly Ala His His Val Gly Tyr Met Phe Leu Gly Glu Val Ala Leu 
975 980 985 

GGC AGA GAG CAC CAT ATC AAC ACG GAC AAC CCC AGC TTG AAG AGC CCA 1627 
Gly Arg Glu His His lie Asn Thr Asp Asn Pro Ser Leu Lys Ser Pro 
990 995 1000 

CCT CCT GGC TTC GAC AGT GTC ATT GCC CGA GGC CAC ACC GAG CCT GAT 1675 
Pro Pro Gly Phe Asp Ser Val lie Ala Arg Gly His Thr Glu Pro Asp 
1005 1010 1015 

CCG ACC CAG GAC ACT GAG TTG GAG CTG GAT GGC CAG CAA GTG GTG GTG 1723 
Pro Thr Gin Asp Thr Glu Leu Glu Leu Asp Gly Gin Gin Val Val Val 
1020 1025 1030 1035 

CCC CAG GGC CAG CCT GTG CCC TGC CCA GAG TTC AGC AGC TCC ACA TTC 17 71 

Pro Gin Gly Gin Pro Val Pro Cys Pro Glu Phe Ser Ser Ser Thr Phe 

1040 1045 1050 

TCC CAG AGC GAG TAC CTC ATC TAC CAG GAG AGC CAG TGT CGC CTG CGC 1819 
Ser Gin Ser Glu Tyr Leu lie Tyr Gin Glu Ser Gin Cys Arg Leu Arg 
1055 1060 1065 

TAC CTG CTG GAG GTC CAC CTC TGA GTGCCCGCCC TGTCCCCCGG GGTCCTGCAA 1873 
Tyr Leu Leu Glu Val His Leu * 

1070 1075 

GGCTGGACTG TG ATC TTC AA TCATCCTGCC CATCTCTGGT ACCCCTATAT CACTCCTTTT 1933 

TTTCAAGAAT ACAATACGTT GTTGTTAACT ATAGTCACCA TGC TGT AC AA GATCCCTGAA 1993 

CTTATGCCTC C T AAC TG AAA TTTTGTATTC TTTGACACAT CTGCCCAGTC CCTCTCCTCC 2053 

CAGCCCATGG TAACCAGCAT TTGACTCTTT ACTTGTATAA GGGCAGCTTT TATAGGTTCC 2113 

ACATGTAAGT GAGATCATGC AGTGTTTGTC TTTCTGTGCC TGGC TTATTT CACTCAGCAT 2173 

AATGTGCACC GGGTTCACCC ATGTTTTCAT AAATGACAAG ATTTCCTCCT TTAAAAAAAA 2233 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 2265 



(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Leu Leu Phe Leu Ala Met Ala Pro Lys Pro Lys Pro Trp Val 
1 5 10 15 

Gin Thr Glu Gly Pro Glu Lys Lys Lys Gly Arg Gin Ala Gly Arg Glu 
20 25 30 

Glu Asp Pro Phe Arg Ser Thr Ala Glu Ala Leu Lys Ala lie Pro Ala 
35 40 45 

Glu Lys Arg lie lie Arg Val Asp Pro Thr Cys Pro Leu Ser Ser Asn 
50 55 60 

Pro Gly Thr Gin Val Tyr Glu Asp Tyr Asn Cys Thr Leu Asn Gin Thr 
65 70 75 80 

Asn lie Glu Asn Asn Asn Asn Lys Phe Tyr lie lie Gin Leu Leu Gin 

85 90 95 

Asp Ser Asn Arg Phe Phe Thr Cys Trp Asn Arg Trp Gly Arg Val Gly 
100 105 110 

Glu Val Gly Gin Ser Lys lie Asn His Phe Thr Arg Leu Glu Asp Ala 
115 120 125 

Lys Lys Asp Phe Glu Lys Lys Phe Arg Glu Lys Thr Lys Asn Asn Trp 
130 135 140 

Ala Glu Arg Asp His Phe Val Ser His Pro Gly Lys Tyr Thr Leu lie 
145 150 155 160 

Glu Val Gin Ala Glu Asp Glu Ala Gin Glu Ala Val Val Lys Val Asp 

165 170 175 

Arg Gly Pro Val Arg Thr Val Thr Lys Arg Val Gin Pro Cys Ser Leu 
180 185 190 

Asp Pro Ala Thr Gin Lys Leu lie Thr Asn lie Phe Ser Lys Glu Met 
195 200 205 



Phe Lys Asn Thr 
210 

Leu Gly Lys Leu 
225 

Glu Ala Leu Glu 



Leu Glu Glu Leu 
260 

Gly His Ser Gin 
275 

Lys Lys Asp Met 
290 

Leu Gin Ala Val 
305 

Pro Leu Asp Arg 



Asp Ser Gly Ala 
340 

Thr Gly Ser Asn 
355 

Asn Gin Glu Gly 
370 

Asn Arg Lys Leu 
385 

lie Leu Thr Ser 



Gly Lys Gly lie 
420 

Val lie Gly Met 
435 

Gly Glu Val Ala 
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Met Ala Leu Met 
215 

Ser Lys Gin Gin 
230 

Glu Ala Leu Lys 
245 

Ser Ser His Phe 



Pro Pro Pro lie 
280 

Leu Leu Val Leu 
295 

Ser Glu Gin Glu 
310 

Asp Tyr Gin Leu 
325 

Pro Glu Tyr Lys 



His Arg Cys Pro 
360 

Glu Glu Asp Arg 
375 

Leu Trp His Gly 
390 

Gly Leu Arg lie 
405 

Tyr Phe Ala Ser 



Lys Cys Gly Ala 
440 

Leu Gly Arg Glu 



Asp Leu Asp Val 
220 

lie Ala Arg Gly 
235 

Gly Pro Thr Asp 
250 

Tyr Thr Val lie 
265 

Asn Ser Pro Glu 



Ala Asp lie Glu 
300 

Lys Thr Val Glu 
315 

Leu Lys Cys Gin 
330 

Val lie Gin Thr 
345 

Thr Leu Gin His 



Phe Gin Ala His 
380 

Thr Asn Met Ala 
395 

Met Pro His Ser 
410 

Glu Asn Ser Lys 
425 

His His Val Gly 



His His lie Asn 



Lys Lys Met Pro 



Phe Glu Ala Leu 
240 

Gly Gly Gin Ser 
255 

Pro His Asn Phe 
270 

Leu Leu Gin Ala 
285 

Leu Ala Gin Ala 



Glu Val Pro His 
320 

Leu Gin Leu Leu 
335 

Tyr Leu Glu Gin 
350 

I le Trp Lys Val 
365 

Ser Lys Leu Gly 



Val Val Ala Ala 
400 

Gly Gly Arg Val 
415 

Ser Ala Gly Tyr 
430 

Tyr Met Phe Leu 
445 

Thr Asp Asn Pro 



450 



455 



460 



Ser Leu Lys Ser Pro Pro Pro Gly Phe Asp Ser Val lie Ala Arg Gly 
465 470 475 480 

His Thr Glu Pro Asp Pro Thr Gin Asp Thr Glu Leu Glu Leu Asp Gly 

485 490 495 

Gin Gin Val Val Val Pro Gin Gly Gin Pro Val Pro Cys Pro Glu Phe 
500 505 510 

Ser Ser Ser Thr Phe Ser Gin Ser Glu Tyr Leu lie Tyr Gin Glu Ser 
515 520 525 

Gin Cys Arg Leu Arg Tyr Leu Leu Glu Val His Leu * 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1740 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 112 1710 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CCCGGCTTTC XCTTCTTCTQ CTOCCTa *JO GAACACCTCG AGCGAACT9C TTCCTAACTC 60 

AOGOTOOOCA GAACTGAOGG OATCTAAGCT TCTOCATCTC TOAGOAGAAC C ATO OCT 117 

Met Ala 

CCA AAA CGA AAO OCC TCT GTO CAG ACT GAG QQC TCC AAG AAO CAG CGA 165 
Fro Lys Arg Lys Ala Bmx Val Gin Thr Glu Gly Mr Lys Lys Gin Are 
545 550 555 
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caa ooa aca gag gag ons gac mc ttc ooo tcc act ooc oag ocr crc 2x3 

am Qlv Thr Qlu Olu Qlu Asp Ser Phm Arg Ser Thr Ala Olu Ala Leu 

SCO 565 570 575 

AGA GCA GCA CCT OCT GAT AAT COO CTC ATC COT OTG GAC CCC TCA TOT 261 
Arg Ala Ala Pro Ala Asp Asn Arg Val 11a Arg Val Aap Pro Ser Cys 
9 560 585 590 

CCA TTC AGC COO AAC CCC OGO ATA CAO GTC CAC GAG QAC TAT QAC TOT 309 
Pro Phe Ser Arg Asn Pro Oly He Gin Val Bia Olu Aap Tyr Aap Cys 
595 600 605 

XCC CTG AAC CAO ACC AAC ATC GOC AAC AAC AAC AAC AAG TTC TAT ATT 357 
Thr Leu Aan Gin Thr Asn He Gly Asn Asn Asn Asn Lys Phe Tyr He 
610 615 620 

ATC CAA CTG CTG GAG GAG GGT AGT CGC TTC TTC TGC TGG AAT CQC TOO 405 
He Gin Leu Leu Glu Glu Gly Ser Arg Phe Phe Cys Trp Asn Arg Trp 
625 630 635 

GOC CGC OTG GGA GAG GTG GOC CAO AOC AAG ATG AAC CAC TTC ACC TGC 453 
Gly Arg Val Gly Olu Val Gly Gin Ser Lys Net Asn Bis Phe Thr Cys 
640 645 650 655 

CTG GAA GAT GCA AAG AAG GAC TTT AAG AAG AAA TTT TOG GAG AAG ACT 501 
Leu Glu Asp Ala Lys Lys Asp Phe Lya Lys Lys Phe Trp Glu Lys Thr 
660 665 670 

AAA AAC AAA TGG GAG GAG CGG GAC COT TTT GTG GCC CAG CCC AAC AAG 549 
Lys Asn Lys Trp Glu Glu Arg Asp Arg Phe Val Ala Gin Pro Asn Lys 
675 660 685 

TAC ACA CTT ATA GAA GTC CAG GGA GAA OCA OAO AOC CAA GAG OCT OTA 597 
Tyr Thr Leu He Glu Val Gin Gly Olu Ala Glu Ser Gin Glu Ala Val 
690 695 700 

GTG AAG GCC TTA TCT CCC CAO OTG GAC AGC GOC CCT GTG AGO ACC OTG 645 
Val Lys Ala Leu Ser Pro Gin Val Asp Ser Gly Pro Val Arg Thr Val 
705 710 715 

GTC AAG CCC TGC TCC CTA GAC CCT GCC ACC CAG AAC CTT ATC ACC AAC 693 
Val Lys Pro Cys Ser Leu Asp Pro Ala Thr Gin Asn Leu He Thr Asn 
720 725 730 735 

ATC TTC AGC AAA GAG ATG TTC AAG AAC GCA ATG ACC CTC ATG AAC CTG 741 
He Phe Ser Lys Glu Met Phe Lys Asn Ala Met Thr Leu Met Asn Leu 
740 745 750 

GAT GTG AAG AAG ATG CCC TTO GGA AAG CTG ACC AAG CAG CAG ATT GCC 789 
Asp Val Lys Lys Met Pro Leu Gly Lys Leu Thr Lys Gin Gin He Ala 
755 760 765 

COT OGC TTC GAG GCC TTG GAA OCT CTA GAG GAG GCC ATG AAA AAC CCC 837 
Arg Gly Phe Glu Ala Leu Glu Ala Leu Glu Glu Ala Met Lys Asn Pro 
770 775 780 

ACA GOG GAT GGC CAG AGC CTG GAA GAG CTC TCC TCC TGC TTC TAC ACT 885 
Thr Oly Asp Gly Gin Ser Leu Glu Glu Leu Ser Ser Cys Phe Tyr Thr 
785 790 795 

GTC ATC CCA CAC AAC TTC GGC CGC AGC CQA CCC CCG CCC ATC AAC TCC 933 
Val He Pro His Asn Phe Gly Arg Ser Arg Pro Pr Pro XI Asn Ser 
S00 B05 810 815 

CCT GAT OTG CTT CAG GCC AAG AAG GAC ATG CTG CTG OTG CTA GOG GAC 981 
Pro Asp Val Leu In Ala Lys Lys Asp Met Leu Leu Val Leu Ala Asp 
820 835 830 
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&TC OAS TIG OCQ CM ACC TTO CM OCA GCC CCT GOG OAQ GAG GAG GAG 
lie Glu 3-eu Ala Gin Thr Leu Oln Ala Ala Pro Oly Olu Glu Glu Glu 
035 840 M5 

AAA GTG OAA GAG GTG CCA CAC CCA CTG GAT CGA GAC TAC CAG CTC CTC 
Lys Val Glu Olu Val Pro His Pro Leu Asp Arg Asp Tyr Gin Leu Leu 
850 855 860 

AGG TGC CAG CTT CAA CTG CTG GAC TCC GOO GAG TCC GAG TAC AAG OCA 
Arg Cye Gin Leu Gin Leu Leu Aap Ser Gly Olu Sar Glu Tyr Ly» Ala 
865 870 875 

ATA CAG ACC TAC CTG AAA CAG ACT GGC AAC AGC TAC AGG TGC CCA AAC 
He Gin Thr Tyr Leu Lys Gin Thr Gly Asn Ser Tyr Arg Cya Pro Asn 
880 885 890 895 

CTG CGG CAT GOT TOG AAA GTG AAC CGA GAA GGG GAG GGA GAC AGG TTC 
Leu Arg Hie Val Trp Lye Val Asn Arg Glu Gly Glu Gly Aap Arg Phe 
900 905 910 

CAG GCC CAC TCC AAA CTG GGC AAT CGG AGG CTG CTG TGG CAC GGC ACC 
Gin Ala His Ser Lye Leu Gly Asn Arg Arg Leu Leu Trp His Gly Thr 
915 920 925 

AAT GTG GCC GTG GTG GCT GCC ATC CTC ACC ACT GGG CTC CGA ATC ATG 
Asn Val Ala Val Val Ala Ala He Leu Thr Ser Gly Leu Arg He Met 
930 935 940 

CCA CAC TCG GGT GGT COT GTT GGC AAG GOT ATT TAT TTT GCC TCT GAG 



Pro His Sar Gly Gly Arg Val Gly Lys Gly He Tyr Phe Ala Ser Glu 
945 950 955 

AAC AGC AAG TCA GCT GGC TAT GTT ACC ACC ATG CAC TGT GGG GGC CAC 
Asn Ser Lys Ser Ala Gly Tyr Val Thr Thr Met His Cys Gly Gly His 

960 965 970 975 

CAG GTG GGC TAC ATG TTC CTG GGC GAG GTG GCC CTC GGC AAA GAG CAC 
Gin Val Gly Tyr Met Phe Leu Gly Glu Val Ala Leu Gly Lys Glu His 
980 985 990 



CAC ATC ACC ATC GAT GAC CCC AGC TTG AAG AGT CCA CCC CCT GGC TTT 
His He Thr He Aap Asp Pro Ser Leu Lys Ser Pro Pro Pro Gly Phe 
995 1000 1005 

GAC AGC GTC ATC GCC CGA GGC CAA ACC GAG CCG GAT CCC GCC CAG GAC 
Asp Ser Val He Ala Arg Gly Gin Thr Glu Pro Asp Pro Ala Gin Asp 
1010 1015 1020 

ATT GAA CTT GAA CTG GAT GGG CAG CCG GTG GTG GTG CCC CAA GGC CCG 
He Glu Leu Glu Leu Asp Gly Gin Pro Val Val Val Pro Gin Gly Pro 
1025 1030 1035 

CCT GTG CAG TGC CCG TCA TTC AAA AGC TCC AGC TTC AGC CAG AGT GAA 
Pro Val Gin Cys Pro Ser Phe Lys Ser Ser Ser Phe Ser Gin Ser Glu 
X040 X045 1050 1055 

TAC CTC ATA TAC AAG GAG AGC CAG TGT CGC CTG CGC TAC CTG CTG GAG 
Tyr Leu He Tyr Lys Glu Ser Gin Cys Arg Leu Arg Tyr Leu Leu Glu 
1060 1065 1070 



ATT CAC CTC TAAGCTGCTT GCCCTCCCTA GGTCCAAGCC 
II His Leu 



1029 

1077 

1125 

1173 

1221 

1269 

1317 

1365 

1413 
1461 
1509 
1557 
1605 
1653 
1701 
1740 
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) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Pro Lye Arg Lys Ala Ser Val Gin Thr Qlu oly Ser Lys Lva 

5 10 IS 

Gin Arg Gin Gly Thr Glu Glu Glu Asp Ser Pbe Arg Ser Thr Ala Glu 
20 25 30 

Ala Leu Arg Ala Ala Pro Ala Aap Asn Arg Val He Arg Val Asp Pro 
35 40 45 

Ser Cys Pro Phe Ser Arg Asn Pro Gly lie Gin Val His Glu Asp Tyr 
Asp Cys Thr Leu Asn Gin Thr Asn lie Gly Asn Asn Asn Asn Lys Phe 



75 



80 



Tyr He He Gin Leu Leu Glu Glu Gly Ser Arg Pbe Pbe Cys Trp Asn 
85 90 95 

Arg Trp Oly Arg Val Gly Glu Val Oly Oln Ser Lye Met Aen His Phe 

10 ° 105 no 

Thr Cys Leu Glu Asp Ala Lys Lys Asp Phe Lys Lys Lys Pbe Trp Glu 

115 120 225 

Lys Thr Lys Asn Lys Trp Glu Glu Arg Asp Arg Phe Val Ala Gin Pro 

"0 135 

Aen Lys Tyr Thr Leu He Glu Val Gin Gly Glu Ala Glu Ser Gin Glu 
145 150 155 i €0 

Ala Val Val Lys Ala Leu Ser Pro Gin Val Asp Ser Gly Pro Val Ara 
165 170 175 * 

Thr Val Val Lys Pro Cys Ser Leu Asp Pro Ala Thr Oln Asn Leu lie 
180 185 190 

^ A * n lit Phc Ser Lys oXu Mct ph * I*y« Asn Ala net Thr Leu Met 

155 200 205 

Asn Leu Asp Val Lys Lys Met Pro Leu Gly Lys Leu Thr Lys Gin Gin 
310 2 15 220 

He Ala Arg Gly Phe Glu Ala Leu Glu Ala Leu Glu Glu Ala Met Lys 
225 230 235 240 

Asn Pro Thr Gly Aap Gly Gin Ser Leu Glu Glu Leu Ser Ser Cys Phe 
245 250 255 

Tyr Thr Val He Pro His Asn Phe Gly Arg Ser Arg Pro Pro Pro He 
260 265 270 

Aan Ser Pro Aap Val Leu Gin Ala Lys Lys Asp Met Leu Leu Val Leu 
2^5 280 285 
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Ala A*p II. Glu Leu Ala Gin Thr Leu oin Ala Ala Pro Gly Olu Olu 



300 



Olu «u Ly. Val Glu Glu V.l Pro His Pro Leu Aap Artj A.p Tyx Gin 

315 320 

Leu Leu Arg Cy, Gin Leu Gin Leu Lau Aap Ser Gly Olu Sex Glu Tyr 



33 S 



1*. Ala lie Gin Thr Tyr Leu Lys Gin Thr Gly Aan Ser Tyr Arg Cy. 



350 



Pro Aan Leu Arg Hi. Val Trp Ly. Val Aan Arg Glu Gly Glu Oly A*> 

365 



Arg Phe Qln Ala Hi a Ser Ly. Leu Gly Aan Arg Arg Leu Leu Trp 



380 



Hi. 



Oly Thr A.„ Val Ala Val Val Ala Ala He Leu Thr Ser Gly Leu Arg 

355 400 
lie Met Pro Hie Ser Gly Gly Arg Val Gly Lys Gly lie Tyr Phe Als 

Ser Glu Asn Ser Lye Ber Ala Gly Tyr Val Thr Thr Met His Cys Gly 

425 430 

Leu Gly Glu Val Ala 

440 



Gly Hi. Gin v.l Gly Tyr Met Ph. Leu Gly Glu Val Ala Leu Gly Ly. 
Glu Hi. Hi. II. Thr lie Aap Aap Pro Ser Lau Lya Ser Pro Pro Pro 

Gly Ph. Aap Ser Val lie Ala Arg Oly Gin Thr Glu Pro A^, Pro Ala 

47s 480 

Gin Asp He Olu Leu Glu Leu Asp Gly Gin Pro Val Val Val Pro Gin 
*■» 490 495 

Gly Pro Pro Val Gin Cys Pro Ser Phe Lys Ser Ser Ser Phe Ser Gin 
500 505 s 10 

Ser Glu Tyr Leu He Tyr Lys Glu Ser Gin Cy 8 Arg Leu Arg Ty, Leu 



S2S 



Leu Glu He His Leu 

530 



!) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1587 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANT I SENSE : NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 1584 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG OCT CCA AAA CO* AAG OCC TCT GTG CMS ACT GAG OOC TCC AAG AAO 48 
Met Ala Pro Lya Arg Lya Ala Ser Val Qln Thr Olu Oly Ser Lye Lve 

535 540 545 

CAC CGA CAA OGG ACA GAG GAG GAG GAC AGC TTC COG TCC ACT GCC GAG 96 
Gin Arg Gin Gly Thr Olu Glu Olu Aap Ser Phe Arg Ser Thr Ala Glu 
550 555 560 565 

GCT CTC AGA GCA GCA CCT GCT GAT AAT CGG GTC ATC CGT GTG GAC CCC 144 
Ala Leu Arg Ala Ala Pro Ala Aap Aan Arg Val He Arg Val Aap Pro 
570 575 580 

TCA TOT CCA TTC AGC CGG AAC CCC GGG ATA CAG GTC CAC GAG GAC TAT 192 
Ser Cya Pro Phe Ser Arg Aan Pro Gly lie Gin Val Hi. Glu Asp Tvr 
585 590 595 

QAC TGT ACC CTG AAC CAG ACC AAC ATC OGC AAC AAC AAC AAC AAG TTC 240 
Aep Cya Thr Leu Aan Gin Thr Aan He Gly Asn Aan Aan Aan Lye Phe 
600 605 610 

TAT ATT ATC CAA CTG CTG GAG GAG GOT AGT OGC TTC TTC TGC TOG AAT 288 
Tyr lie lie Gin Leu Leu Glu Olu Gly Ser Arg Phe Phe Cya Trp Aan 
615 620 625 



336 



384 



432 



480 



COC TOO OGC COC OTO Q9A OAO OTO OGC CAC AOC AAG ATG AAC CAC TTC 
Arg Trp Oly Arg Val Oly Olu Val Oly Gin Ser Lya Met Aan Hie 

630 635 640 645 

ACC TGC CTO QAA OAT GCA AAG AAG GAC TIT AAG AAG AAA TTT TCG GAG 
Thr Cya Leu Olu Asp Ala Lya Lya Aap Phe Lya Lya Lya Phe Trp Glu 
«50 655 660 

AAG ACT AAA AAC AAA TOG GAG GAG CGG GAC CGT TTT GTG OCC CAG CCC 
Lya Thr Lya Aan Lya Trp Olu Olu Arg Aap Arg Phe Val Ala Gin Pro 
665 670 675 

AAC AAG TAC ACA CTT ATA GAA GTC CAG GGA GAA OCA GAG AGC CAA GAG 
Aan Lya Tyr Thr Leu He Olu Val Gin Gly Glu Ala Glu Ser Oln Olu 
680 685 690 

GCT GTA GTG AAG OTO GAC AGC OOC CCT GTG AGO ACC GTG GTC AAG CCC S2fi 
Ala Val Val Lya Val Aap Ser Oly Pro Val Arg Thr Val Val Lya Pro 
695 700 70S 

TGC TCC CTA GAC CCT GCC ACC CAG AAC CTT ATC ACC AAC ATC TTC AGC 576 
Cya Ser Leu Aap Pro Ala Thr Oln Aan Leu lie Thr Aan He Phe Ser 

710 715 720 725 

AAA GAG ATG TTC AAG AAC GCA ATG ACC CTC ATG AAC CTG GAT GTG AAG 624 
Lye Glu Met Phe Lya Aan Ala Met Thr Leu Met Aan Leu Aap Val Lya 

730 735 740 

AAG ATG CCC TTG GGA AAG CTG ACC AAG CAG CAG ATT GCC CGT GGC TTC 672 
Lya Met Pro Leu Gly Lya Leu Thr Lye Gin Oln He Ala Arg Gly Phe 
745 750 755 

GAG OCC TTG GAA OCT CTA OAG GAG OCC ATG AAA AAC CCC ACA GOO GAT 720 
Glu Ala Leu Glu Ala Leu Olu Olu Ala Mat Lya Aan Pro Thr Oly Aan 
760 765 770 
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OTC^*flCCT0«XaWCTCTCCTCCT0CTTCT»CACrOTCATCCCA 
Oly Ola tar Leu Glu Olu Leu Ser 8er Cye Ph. Tyr Thr Val lie Pro 

775 780 785 

CAC AAC TTC OOC COC AGC COA CCC CCO CCC ATC AAC TCC CCX OAT GTG 
Hie Asd Phe Oly Arg Sex Arg Pro Pro Pro lie Aen 6er Pro An Val 
790 795 800 ' B 05 

CTT CAG OCC AAO AAQ GAC ATG CTG CTG OTO CTA GCG GAC ATC OAG TTC 
Leu Gin Ala Lye Lye Aep Met Leu Leu Val Leu Ale Aep lie Olu Leu 
810 815 820 

GCG CAG ACC TTO CAG OCA OCC CCT OOG GAG GAG OAG GAG AAA OTO GAA 
Ala Gin Thr Leu Gla Ala Ala Pro Gly Glu Olu Olu Glu Lya Val Olu 
"5 830 B35 

^ ?™ ^ ^ ^ ^ C ^ r*C CJ& CTC CTC MX TQC CM 
Glu Val Pro Hie Pro Leu Aep Arg Aap Tyr Gin Leu Leu Arg Cys Gin 
"0. 845 850 

CTT CAA CTG CTG GAC TCC GGG GAG TCC GAG TAC AAG OCA ATA CAG ACC 
Leu Gin Leu Leu Aap Ser Gly Glu Ser Glu Tyr Lye Ala lie Gin Thr 
"5 860 86S 

TAC CTG AAA CAG ACT GGC AAC A6C TAC AGG TGC CCA AAC CTG CGG CAT 
Tyr Leu Lye Gin Thr Gly Aan Ser Tyr Arg Cye Pro Aen Leu Arg Hia 
870 875 880 " 885 

GTT TOG AAA CTG AAC COA GAA GGG GAG GGA GAC AGG TTC CAG GCC CAC 

Val Trp Lye Val Aan Arg olu Cly Glu Gly Aep Arg Phe Gin Ala Hi. 

B5 " 895 900 

TCC AAA CTG GGC AAT COG AGG CTG CTG TOO CAC GGC ACC AAT GTG GCC 
Ser Lye Leu Gly Aen Arg Arg Leu Leu Trp Rle Gly Thr Aen Val Ala 
905 910 9X5 

OTO OTO OCT OCC ATC CTC ACC ACT OOG CTC COA ATC ATG CCA CAC Tea 
val val Ala Ala lie Leu Thr Ser Gly Leu Arg l5 12? £S Jg 
S20 925 930 

GOT GOT CGT GTT GGC AAO GOT ATT TAT TTT GCC TCT OAG AAC AGC Afcfi 
Gly Oly Arg Val Oly Lye Oly lie Tyr Phe Ala Ser £S 22 iJ2 
« a 940 945 * 

?^^2? CT ^ OTMC ^^^ TO W<^CACCAGGTGGGC 
S«r Ala Gly Tyr Val Thr Thr Met Ri. Cy. oly Gly Si ?2 

955 960 955 

•t^E .f" OTO OCC CTC GGC AAA GAG CAC CAC ATC ACC 

Tyr Met Phe Leu Oly Glu Val Ala Leu Oly Ly. Olu Hie Hia lie Thr 

ATC GAT GAC CCC AGC TTG AAO AGT CCA CCC CCT GGC TTT GAC AGC GTC 
lie Aep Aep Pro Ser Leu Lye Ser Pro Pro Pro Oly Phe Aep Ser Val 

^85 990 995 

ATC GCC CGA GGC CAA ACC GAG CGG GAT CCC GCC CAG GAC ATT OAA CTT 
lie Ala Ars Gly Gin Thr Glu Pro Aap Fro Ala Gin Asp He Glu Leu 
1000 100S ioio 

GAA CTG GAT GGG CAG CCG GTG GTG GTG CCC CAA GGC COG CCT GTG CAG 
Glu Leu Aap Gly Gin Pro Val Val Val Pro Gin Oly Pro Pro Val Gin 
1015 1020 1025 

TGC CCG TCA TTC AAA AGC TCC AGC TTC AOC CAG AGT GAA TAC CTC ATA 
Cya Pro Ser Phe Lya Ser Ser Ser Phe Ser Gin Ser £S gr 25 S 

1035 1040 1045 

2?^ 2?^ ***** TGf CGC CTG OOC TAC CTG CTG GAG ATT CAC CTC 

Tyr Lya Glu Ser Gin Cy. ArgLeuArgTyrl^US^iS^S 
1050 1055 10*0 

TAA 



768 



816 



864 



912 



950 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



1587 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Pro Lye Ax? Lye Ala Ser Val Oln Thr Olu Gly Ser Lye Lve 

1 5 « 15 

Gin Arg Gin Gly Thr Glu Glu Glu Asp Ser Phe Arg Ser Thr Ala Glu 
20 25 30 

Ala Leu Arg Ala Ale Pro Ala Aep Asn Arg Val He Arg Val Asp Pro 
35 40 45 

Ser Cye Pro Phe Ser Arg Aen Pro Gly lie Gin Val Hie Glu Aep Tyr 

55 60 

Aep Cye Thr Leu Asn Gin Thr Aen He Gly Aen Aen Aen Aen Lye Phe 

65 70 75 BO 

Tyr He He Gin Leu Leu Glu Glu Gly Ser Arg Phe Phe Cye Trp Aen 
BS 90 95 

Arg Trp Gly Arg Vel Gly Glu Val Gly Gin Ser Lye Met Aen Hie Phe 

1°° 105 no " 

Thr Cye Leu Glu Aep Ala Lye Lye Aep Phe Lye Lye Lye Phe Trp Olu 

115 120 125 

Lye Thr Lye Aen Lye Trp Glu Glu Arg Aep Arg Phe Val Ala Gin Pro 
130 135 140 

Aen Lye Tyr Thr Leu He Glu Val Oln Gly Glu Ala Glu Ser Gin Glu 
145 15 ° 155 is© 

Ala Val Val Lye Val Aep Ser Gly Pro Vel Arg thr Val Val Lye Pro 

155 170 175 

Cye Ser Leu Aep Pro Ala Thr Oln Aen Leu He Thr Aen He Phe Ser 
180 185 190 

Lye Glu Met Phe Lye Aen Ala Met Thr Leu Met Aen Leu Aep Val Lye 

155 200 205 

Lye Met Pro Leu Gly Lye Leu Thr Lye Gin Gin He Ala Arg Gly Phe 
210 215 220 

Glu Ala Leu Glu Ala Leu Glu Glu Ala Met Lye Aen Pro Thr Gly Aep 
225 230 235 240 

Gly Gin Ser Leu Glu Glu Leu Ser Ser Cye Phe Tyr Thr Val He Pro 
245 250 255 

Hie Aan Phe Gly Arg Ser Arg Pro Pr Pro He Aen Ser Pro Aep Val 
250 265 270 

Leu Gin Ala Lye Lye Aep Met Leu Leu Val Leu Ala Aep 11 Glu Le U 
275 2B0 285 



73 



Ala Oln 

290 

Glu Val 
305 

Leu Gin 
Tyx Leu 
Val Trp 



Ser Lys 

370 

Val Val 
385 



Thr Leu Gin Ala Ala Pro Gly Glu Glu 
295 

Pro His Pro Leu Asp Arg Asp Tyr Gin 
310 315 

Leu Leu Asp Ser Gly Glu Ser Glu Tyx 
325 330 

Lys Gin Thr Gly Asn Ser Tyr Arg Cys 
340 345 

Lys Val Asn Arg Glu Gly Glu Gly Asp 
355 360 ^ P 

Leu Gly Asn Arg Arg Leu Leu Trp His 
375 

Ala Ala lie Leu Thr Ser Gly Leu Arg 
390 395 



Glu Glu Lys Val Glu 
300 

Leu Leu Arg Cys Gin 
320 

Lys Ala He Gin Thr 

335 

Pro Asn Leu Arg His 

350 

Arg Phe Gin Ala His 
365 

Gly Thr Asn Val Ala 
380 

He Met Pro His Ser 
400 



Gly Gly Arg Val Gly Lys Gly lie Tyr Phe Ala Ser Glu Asn Ser Lys 

410 4X5 
Ser Ala Gly Tyr Val Thr Thr Net His Cys Gly Gly His Gin V.l Gly 
* 9 425 430 

Tyr Met Phe Leu Gly Glu Val Ala Leu Gly Lye Glu Bis His lie Thr 

440 445 

He Asp Asp Pro Ser Leu Lys Ser Pro Pro Pro Oly Phe Asp ser Val 

455 4(0 

lie Ala Arg Gly Gin Thr Glu Pro Asp Pro Ala Gin Asp lie oiu Leu 

470 «75 480 

Olu Leu A^> Gly Gin Pro Val Val Val Pro Gin Gly Pro Pro Val Gin 
* B5 490 49s 

Cys Pro Ser Phe Lye Ser Ser Ser Phe Ser Gin Ser Glu Tyr Leu He 
500 505 510 

Tyr Lys Glu Ser Gin Cys Arg Leu Arg Tyr Leu Leu Olu He Bis Leu 

* 520 525 
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We claim: 




1. 

5 



10 



15 

2. 

20 
25 

3. 

30 
35 
40 

4 . 



45 



A poly (ADP-ribose ) polymerase (PARP) homolog which has an 
amino acid sequence which has 

a) a functional NAD + binding domain and 

b) no zinc finger sequence motif of the general formula 



in which 

m is an integral value from 28 or 30, and the X radicals 
are, independently of one another, any amino acid; 
and the functional equivalents thereof. 

A PARP homolog as claimed in claim 1, wherein the functional 
NAD + binding domain comprises one of the following general 
sequence motifs: 

PX n ( S /T ) GX3GKGI YFA , 

(S/T)XGLR(I/V)XPX n (S/T)GX 3 GKGIYFA or 

LLWHG ( S /T ) X 7 IL ( S /T ) XGLR ( I / V ) XPX n ( S /T ) GX3GKGI YFAX3 SKSAXY 
in which 

n is an integral value from 1 to 5, and the X radicals are, 
independently of one another, any amino acid. 

A PARP homolog as claimed in either of the preceding claims, 
comprising at least another one of the following 
part-sequence motifs: 



LX 9 NX 2 YX 2 QLLX ( D/E ) X 10 /nWGRVG, 
AX3FXKX4KTXNXWX5FX3PXK , 
QXL ( I/L ) X2lX 9 MX 10 PLGKLX3QIX 6 L, 
FYTXIPHXFGX3PP; and 
KX 3 LX 2 LXDIEXAX 2 L , 



in which the X radicals are, independently of one another, 
any amino acid. 

A PARP homolog as claimed in any of the preceding claims, 
selected from human PARP homologs, which has the amino acid 
sequence shown in SEQ ID NO: 2 (human PARP 2 ) or SEQ ID NO: 4 
or 6 (human PARP 3 type 1 or 2 ); or murine PARP homologs which 
have the amino acid sequence shown in SEQ ID NO: 8 (mouse PARP 
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long form) or SEQ ID No: 10 (mouse PARP short form); and the 
functional equivalents thereof, 

5 . A binding partner for PARP homologs as claimed in any of the 
5 preceding claims, selected from 

a) antibodies and fragments thereof, 

b) protein-like compounds which interact with a 
part-sequence of the protein, and 

c) low molecular weight effectors which modulate the 

10 catalytic PARP activity or another biological function of 

a PARP molecule. 

6. A nucleic acid comprising 

a) a nucleotide sequence coding for at least one PARP 
15 homolog as claimed in any of claims 1 to 4, or the 

complementary nucleotide sequence thereof; 

b) a nucleotide sequence which hybridizes with a sequence as 
specified in a) under stringent conditions; or 

c) nucleotide sequences which are derived from the 

20 nucleotide sequences defined in a) and b) through the 

degeneracy of the genetic code. 

7. A nucleic acid as claimed in claim 6, comprising 

25 a) nucleotides +3 to +1715 shown in SEQ ID N0:1; 

b) nucleotides +242 to +1843 shown in SEQ ID NO: 3; 

c) nucleotides +221 to +1843 shown in SEQ ID NO: 5; 

d) nucleotides +112 to +1710 shown in SEQ ID NO: 7; or 

e) nucleotides +1 to +1584 shown in SEQ ID NO: 9. 

30 

8. An expression cassette comprising, under the genetic control 
of at least one regulatory nucleotide sequence, at least one 
nucleotide sequence as claimed in either of claims 6 and 7. 

35 9. A recombinant vector comprising at least one expression 
cassette as claimed in claim 8. 

10. A recombinant microorganism comprising at least one 
recombinant vector as claimed in claim 9. 

40 

11. A transgenic mammal comprising a vector as claimed in 
claim 9. 



45 
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12. A PARP-def icient mammal or PARP-def icient eukaryotic cell, in 
which functional expression of at least one gene which codes 
for a PARP homolog as claimed in any of claims 1 to 4 is 
inhibited. 

5 

13. An in vitro detection method for PARP inhibitors, which 
comprises 

a) incubating an unsupported or supported 

10 polyADP-ribosylatable target with a reaction mixture 

comprising 

al) a PARP homolog as claimed in any of claims 1 to 4, 
a2) a PARP activator; and 

a3) a PARP inhibitor or an analyte in which at least one 
15 PARP inhibitor is suspected; 

b) carrying out the polyADP ribosylation reaction; and 

c) determining the polyADP ribosylation of the target 
qualitatively or quantitatively. 

20 14. A method as claimed in claim 13, wherein the PARP homolog is 
preincubated with the PARP activator and the PARP inhibitor 
or an analyte in which at least one PARP inhibitor is 
suspected, before the polyADP ribosylation reaction is 
carried out. 

25 

15. A method as claimed in either of claims 13 and 14, wherein 
the polyADP-ribosylatable target is a histone protein. 

16. A method as claimed in any of claims 13 to 15, wherein the 
30 PARP activator is activated DNA. 

17. A method as claimed in any of claims 13 to 16, wherein the 
polyADP ribosylation reaction is started by adding NAD + . 

35 18. A method as claimed in any of claims 13 to 17, wherein the 
polyADP ribosylation of the supported target is determined 
using anti-poly (ADP-ribose ) antibodie. 

19. A method as claimed in any of claims 13 to 17, wherein the 
40 unsupported target is labeled with an acceptor fluorophore. 

20. A method as claimed in claim 19, wherein the polyADP 
ribosylation of the unsupported target is determined using 
anti-poly (ADP-ribose) antibody which is labeled with a donor 

45 fluorophore which is able to transfer energy to the acceptor 

fluorophore . 
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21. A method as claimed in either of claims 19 and 20, wherein 
the target is biotinylated histone, and the acceptor 
fluorophore is coupled thereto via avidin or streptavidin. 

5 22. A method as claimed in either of claims 20 and 21, wherein 
the anti-poly (ADP-ribose) antibody carries a europium 
crypt ate as donor fluorophore. 

23. An in vitro screening method for binding partners for a PARP 
10 molecule, which comprises 

al) immobilizing at least one PARP homolog as claimed in any 

of claims 1 to 4 on a support; 
bl) contacting the immobilized PARP homolog with an analyte 
in which at least one binding partner is suspected; and 
15 cl) determining, where appropriate after an incubation 

period, analyte constituents bound to the immobilized 
PARP homolog; 



or 



20 



a2 ) immobilizing on a support an analyte which comprises at 

least one possible binding partner for a PARP molecule; 
b2 ) contacting the immobilized analyte with at least one PARP 

homolog as claimed in any of claims 1 to 4 for which a 
25 binding partner is sought; and 

c2) examining the immobilized analyte, where appropriate 

after an incubation period, for binding of the PARP 

homolog. 

30 24. A method for the qualitative or quantitative determination of 
nucleic acids encoding a PARP homolog as claimed in any of 
claims 1 to 4, which comprises 

a) incubating a biological sample with a defined amount of 
an exogenous nucleic acid as claimed in either of claims 

35 6 and 7, hybridizing under stringent conditions, 

determining the hybridizing nucleic acids and, where 
appropriate, comparing with a standard; or 

b) incubating a biological sample with a pair of 

40 oligonucleotide primers with specificity for a PARP 

homolog-encoding nucleic acid, amplifying the nucleic 
acid, determining the amplification product and, where 
appropriate, comparing with a standard. 

45 25. A method for the qualitative or quantitative determination of 
a PARP homolog as claimed in any of claims 1 to 4 , which 
comprises 
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a) incubating a biological sample with a binding partner 
specific for a PARP homolog, 

b) detecting the binding partner /PARP complex and, where 
appropriate , 

5 c) comparing the result with a standard. 

26. A method as claimed in claim 25, wherein the binding partner 
is an antibody or a binding fragment thereof, which carries a 
detectable label where appropriate. 

10 

27. A method as claimed in any of claims 24 to 26 for diagnosing 
energy deficit -mediated illnesses. 

28. A method for determining the efficacy of PARP effectors, 
15 which comprises 

a) incubating a PARP homolog as claimed in any of claims 1 
to 4 with an analyte which comprises an effector of a 
physiological or pathological PARP activity; removing the 
effector again where appropriate; and 

20 b) determining the activity of the PARP homolog, where 

appropriate after adding substrates or cosubstrates . 

29. A gene therapy composition, which comprises in a vehicle 
acceptable for gene therapy a nucleic acid construct which 

25 a) comprises an antisense nucleic acid against a coding 

nucleic acid as claimed in either of claims 6 and 7; or 

b) a ribozyme against a nucleic acid as claimed in either of 
claims 6 and 7 ; or 

c) codes for a specific PARP inhibitor. 

30 

30. A pharmaceutical composition comprising, in a 
pharmaceutically acceptable vehicle, at least one PARP 
protein as claimed in any of claims 1 to 4, at least one PARP 
binding partner as claimed in claim 5 or at least one coding 

35 nucleotide sequence as claimed in claim 6 or 7 . 

31. The use of low molecular weight PARP binding partners as 
claimed in claim 5 for the diagnosis or therapy of 
pathological states in the development and/or progress of 

40 which at least one PARP protein, or a polypeptide derived 

therefrom, are [sic] involved. 

32. The use of low molecular weight PARP binding partners as 
claimed in claim 5 for the diagnosis or therapy of 

45 pathological states mediated by an energy deficit. 

58/iT/cb 
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Abstract 

The invention relates to poly ( ADP-ribose ) polymerase (PARP) 
5 homologs which have an amino acid sequence which has 

a) a functional NAD + binding domain 
and 

b) no zinc finger sequence motif of the general formula 

10 CX 2 CX m HX 2 C 
in which 

m is an integral value from 2 8 or 30, and the X radicals are, 
independently of one another, any amino acid; 
and the functional equivalents thereof ; nucleic acids coding 

15 therefor; antibodies with specificity for the novel protein; 
pharmaceutical and gene therapy compositions which comprise 
products according to the invention; methods for the analytical 
determination of the proteins and nucleic acids according to the 
invention; methods for identifying effectors or binding partners 

20 of the proteins according to the invention; novel PARP effectors; 
and methods for determining the activity of such effectors. 
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